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Washington and a Ph.D. in Organic Chemistry in 1970 from the University of California at 
Berkeley. 

2 . I am the Walther Professor of Medicinal Chemistry at Purdue 
University, West Lafayette, Indiana. 

3. I am submitting this declaration to demonstrate that (1) on or before 
May 31, 1994 to June 2, 1994, Francis Barany, George Barany, and Robert Hammer 
conceived the idea of using an array of a plurality oligonucleotide analogue probes coupled to 
a solid support where the probes bind to complementary oligonucleotide targets with similar 
hybridization stability across the array and (2) on or before May 31, 1994 to June 2, 1994, 
this conception was communicated from those scientists to an employee of Affymetrix, Inc., 
Santa Clara, California. 



Dear Sir: 



I, DONALD BERGSTROM, pursuant to 37 CFR § 1 .608(b), declare: 
1 . I received a B.S. in Chemistry in 1965 from the University of 
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4. On or before February 4, 1 994, a grant application, entitled "New 
Methods for Cancer Detection", ("Grant Application") was submitted to the National Cancer 
Institute, U.S. Department of Health and Human Services ("NCI") by, amongst others, me, 
Francis Barany, George Barany, and Robert P. Hammer. This describes five (5) projects, 
including Project 5, entitled "Design and Synthesis of DNA and PNA Arrays"; A copy of 
relevant portions of the Grant Application, including the Project 5 description is attached 
hereto at Appendix 1 . It is my understanding that the portions of the Grant Application 
which are attached at Appendix 1 are an accurate version of what was submitted to the NCI 
on or before February 4, 1994. 

5. Project 5 describes a method of detecting nucleic acid sequence 
differences in target nucleic acids by the use of a solid support with an array of peptide 
nucleotide analogues ("PNA") to capture and detect the products of a ligase detection 
reaction ("LDR") (Appendix 1, p. 19-22). In particular, each ligation product of the LDR 
process is provided with a "zip code" tail (e.g., Zl and Z2) which is selectively captured by a 
complementary "zip code" located at a particular "address" on the solid support (Id.). As 
shown below, different target nucleic acids captured in this manner are detected and 
distinguished from one another by the presence of labels immobilized at different addresses 
on the array (having "zip code" capture probes with different nucleotide sequences) or by the 
presence of different labels immobilized on the array (Id. at 22). 7 - ... ~ ........... t 
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The Grant Application discloses that the array "zip code" capture probes on the solid support 
can be designed to achieve optimal hybridization to the "zip code" tail on the ligation product 
(Id.). This is achieved by increasing thermal stability through reduction of the difference in 
melting temperature (i.e. T m ) between the different duplexes formed by LDR products having 
a "zip code" tail and the complementary "zip codes" (i.e. PNA oligonucleotides).hybridized 
to one another on the solid support (Id.). These differences in melting temperature result 
from differences in G*C/A*T content (Id.). The Grant Application states that the T m for 
correct PNA/DNA hybridization is at least 48°C higher than for any incorrect hybridization 
and neighboring 24-mer capture probes on the solid support are separated by 12-mer 
oligonucleotides which do not hybridize to anything (Id. at 31). The Grant Application also . 
states that analogues with thymine replaced with 5-propynyl uridine can be used in either 
complementary zip codes (DNA or PNA) on the solid support or zip code (DNA) tails on the 
ligation product (Id. at 24). 

6. In preparing the above-described Grant Application, I had the occasion 
to participate in discussions with Francis Barany, George Barany, and Robert P. Hammer and 
recall the features of the invention as described above in ^ 5. By the time that the Grant 
Application was prepared, Francis Barany, George Barany, and Robert P. Hammer had a firm 
idea of the invention as described above. 

7. On May 31,1 994, to June 2, 1 994, the NCI Site Special Review 
Subcommittee for the Grant Application visited Cornell University Medical College, New 
York, New York to meet with the scientists who submitted the Grant Application, including 
me, Francis Barany, George Barany, and Robert P. Hammer. As indicated in the follow-up 
July 20-22, 1994, Draft Review Report (attached hereto at Appendix 2) regarding the Grant 
Application, the NCI Site Special Review Subcommittee included Steven P.A. Fodor, Ph.D., 
Scientific Director and Chief Technical Officer of AfTymetrix, Inc., Santa Clara, California 
(Appendix 2 at p. 32), who I remember being in attendance at the site visit. Dr. Fodor's 
presence at the site visit (as well as on the review subcommittee) caused a great deal of 
concern amongst me and the other grant applicants. Having served on a number of NCI site 
review subcommittees, it is my understanding that subcommittee members should not review 
grant applications where they have a conflict of interest or the appearance of one. For 
example, where a subcommittee member has a financial interest in work closely related or 
competing with the subject matter of the grant application, a conflict of interest arises and 
that subcommittee member should not participate in deliberations and actions on the grant 
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application. Since it was well known at the time of the site visit that Asymetrix, Inc. was 
developing oligonucleotide arrays for the use of detecting single base mutations, I and other 
grant applicants wondered why Dr. Fodor did not remove himself from the review 
subcommittee. 



presence at the site visit to NCI; however, with assurance by NCI that it would guard against 
conflicts of interest problems, Dr. Fodor remained on the review subcommittee. I continued 
to be concerned about Dr. Fodor's presence on the subcommittee and, after learning of work 
subsequently presented by Affymetrix scientists, sent a letter with Francis Barany on 
November 11, 1994, to Dr. James Jacobson of NCI (attached hereto at Appendix 3), in which 
we again protested the presence of Dr. Fodor on the site review subcommittee on the grounds 
that he may have had a conflict of interest. 



are true and that all statements made on information and belief are believed to be true; and 
further that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under section 1001 of 
Title 18 of the United States Code, and that such willful false statements may jeopardize the 
validity of the application or any patent issuing thereon. 



8. 



Prior to the site visit I expressed my concern about Dr. Fodor's 



9. 



I hereby declare that all statements made herein of my own knowledge 



Date: 





Donald Bergstrom 
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(To Declaration of Donald Bergstrom under 37 CFR § 1.608(b)) 
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DEPARTMENT OF HEALTH AND HUMAN SERVICES 
PROGRAM PUBLIC HEALTH SERVICE 

PROJECT GRANT APPLICATION 
Follow instructions carefully. Type in the unshaded areas only, 
Type density must be 10 c.p.l. 



LEAVE BLANK FOR PHS USE ONLY. 



1. TITLE OF PROJECT (Qo not exceed 56 typewriter spaces.) 
NEW METHODS FOR CANCER DETECTION 



2a. RESPONSE TO SPECIFIC REQUEST FOR APPLICATIONS OfMtaOGRAM ANNOJfiCEMENT 

IA2&H 



Number: 



Trtle: CONFI DENT I A 



2b. TYPE OF GRANT PROGRAM P01 




Form Approveo 1 nrougn o.Ju. 
OMB No, 0925-OOui - 



Activity 



'Month, Year) 



Number 



Formerly 



Date Received 



3a. NAME (Last, first, middle) 
BARANY, FRANCIS 



3b. DEGREE(S) 
PH.D. 



3d. POSITION TITLE 
ASSOCIATE PROFESSOR 



3f. DEPARTMENT, SERVICE, LABORATORY, OR EQUIVALENT 

MICROBIOLOGY 



3g. MAJOR SUBDIVISION 

CORNELL UNIV. MEDICAL COLLEGE 



3h. TELEPHONE AND FAX (Area code, number and extension) 
TEL: 212 746-6509 

FAX: 212 746-8587 



4. HUMAN SUBJECTS 



4a. 
NO 



If -Yes. - 

exemption no. or 



IRB 

approval 
date 



4b. Assurance of 
compliance no. 



ID NO □ YES (If "YES, * state number 
m and title) 



CIPAL INVESTIGATOR/PROGRAM DIRECTOR 



3c. SOCIAL SECURITY NO. 
REDACTED 



3e. MAILING ADDRESS (Street, city, state, zip code) 
CORNELL UNIV. MEDICAL COLLEGE 
1300 YORK AVENUE 
NEW YORK, NY 10021 



BITNET/INTERNET ADDRESS BARANY@CUMC ■ CORNELL . ED T 



5. VERTEBRATE ANIMALS 



YES 



X 



5a. 






NO 




YES 



If "Yes." 

1ACUC approval date 



5b. Animal welfare 
assurance no. 



6. DATES OF ENTIRE PROPOSED PROJECT 
PERIOD 



From (MMDDYY) 

120194 



Through (MMDDYY) 

113099 



BUDGET PERIOD 
7a. Direct Costs ($) 

$971,041 



7b. Total Costs ($) 
lgl,224,862 



8. COSTS REQUESTED FOR ENTIRE 
PROPOSED PROJECT PERIOD 

8a. Direct Costs ($) 8b. Total Costs ($) 

$5,295,869 $6,995,293 



» , * 

10. INVENTIONS AND PATENTS (Competing continuation application only) 



9. PERFORMANCE SITES (Organizations and addresses) 

CORNELL UNIV. MEDICAL COLLEGE 
(1300 YORK AVENUE 
NEW YORK, NY 10021 



NO 



If 

YES "YES," 



Previously 
reported 



Not previously 
reported 



1 1 . NAME OF APPLICANT ORGANIZATION 

CORNELL UNIVERSITY MEDICAL COLLEGE 
address 1300 YORK AVENUE 

NEW YORK, NY 10021 



12. TYPE OF ORGANIZATION 

□ Public: Specify □ Federal □ State □ Local 
K Private Nonprofit 

□ Forprofit (General) □ Forprofrt (Small Business) 

15. NAME OF ADMINISTRATIVE OFFICIAL TO BE NOTIFIED IF AWARD 
PHILIP V. GIUCA IS MADE 

TELEPHONE 212 746-6036 

fax 212 746-8745 

"tle SENIOR ASSOCIATE DEAN 

address CORNELL UNIV. MEDICAL COLLEGE 

1300 YORK AVENUE 

NEW YORK, NY 10021 



3ITNET/INTERNETADDRESSQMCUMC . MAIL . CORNELL . EDU 



13. ENTITY IDENTIFICATION NUMBER 

I1131623978A1 



Congressional District 

14 



14. BIOMEDICAL RESEARCH SUPPORT GRANT CREDIT 
CoderOl IdentfficationSCHOOL OF MEDIdTWR 



16. NAME OF OFFICIAL SIGNING FOR APPLICANT ORGANIZATION 

GREGORY W. SISKIND 
telephone 2 12 746-6020 

212 746-8745 
ASSOCIATE DEAN 

CORNELL UNIV. MEDICAL COLLEGE 
1300 YORK AVENUE 
NEW YORK, NY 10021 



FAX 
TITLE 
ADDRESS 



17. PRINCIPAL INVESTIGATOR/PROGRAM DIRECTOR ASSURANCE: 1 agree to accept respon- 
sibility for the scientific conduct of the project and to provide the required progress reports if a grant 
is awarded as a result of this application. Willful provision of false information is a criminal offense 
(U.S. Code, Title 18. Section 1001). I am aware that any false, fictitious, or fraudulent statement 
may, in addition to other remedies available to the Government, subject me to civil penalties under 
the Program Fraud Civil Remedies Act of 1 986 (45 CFR 79). 


SIGNATURE OF PERSON NAMED IN 3a. 
(In ink. "Per signature not acceptable.) 


DATE 


1 8. CERTIFICATION AND ACCEPTANCE: I certify that the statements herein are true and complete 
to the best of my knowledge, and accept the obligation to comply with Public Health Service terms 
and conditions if a grant is awarded as the result of this application. A willfully false certification is 
a criminal offense (U.S. Code, Title 18, Section 1001). I am aware that any false, fictitious or 
fraudulent statement may, in addition to other remedies available to the Government, subject me- 
to civil penalties under the Program Fraud Civil Remedies Act of 1986 (45 CFR 79).' 
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BB Principal Investigator/Program Director (Last first, middle): BARANY. Francis 

Dbii-KU- lluN: Siaie i/ie application s broad, long-tern, objectives and specitic urns, making reference ,o tbe health rclaicdncss oi the project. Describe concisely the research 
design and methods for achieving these goals. Avoid summaries of past accomplishments and the use of the first person. This abstract is mean.To " m u a ^succi^ct^d 
accurate description of the proposed work when separated from tbe application. DO NOT EXCEED THE SPACE PROVIDED. succinci ana 



The long range objective of this proposal is to develop sensitive and specific approaches to the detection 
and simultaneous identification of cancer-related, genetic alterations. Mutations and genetic aberrations have 
been implicated, at various steps, in the etiology and biology of tumors. Inherited mutations account for the 
predisposition to cancer in some families. Somatic mutations in tumor suppressor genes, oncogene 
amplification and viral DNA sequences have been found in cancers as well. However, the clinical use of these 
discoveries and research into their clinical significance has been slowed by the laborious processes by which 
they are detected. To apply these discoveries and explore the interactions of multiple genetic alterations, we 
urgently need a new technology, which is capable of being automated and has the power to detect any of a vast 
number of mutations. 

In response to the urgent need for new methods of mutation detection, we have assembled a team of 
investigators whose expertise will be directed toward innovative solutions to this problem. The collaborative 
nature of the scientific and organizational infrastructure will facilitate the attainment of the projects' specific 
aims and objectives. 

The specific aims of the five projects in this proposal are to: (i) develop a multiplex polymerase chain 
reaction/hgase detection reaction (PCR/LDR) system for the detection of inherited mutations in germline DNA 
and somatic mutations in tumors; (ii) develop a ligase detection reaction/ polymerase chain reaction 
(LDR/PCR) system for detecting gene amplifications and deletions in tumors; (iii) develop a PCR/restriction 
enzyme/LDR (PCR/RE/LDR) system for detecting and identifying mutations in rare cancer cells at a sensitivity 
of 1 in 106 or 1 in 10 7 by removing normal DNA sequences and selectively amplifying cancer mutations- 
(lv) design and synthesize nucleotide analogues for converting specific DNA sequences into restriction 
endonuclease recognition sites for PCR/RE/LDR mutation detection; (v) engineer a thermostable ligase with 
greater fidelity to enhance LDR and LCR specificity; (vi) design and synthesis oligonucleotide or peptide 
nucleic acid (PNA) addressable arrays for the simultaneous detection of multiplex LDR and LCR products- 
and (vu) explore the ability of these technologies to further our understanding and clinical management of 
lung, colon, breast and cervical cancers. 



PERSONNEL ENGAGED ON PROJECT, INCLUDING CONSULT ANTS/COI I A R orators « 

information in the format shown below on off inclividnair^ci^ inrne Ii/LOL ^°^ TOR S. U« co nlmua ,u,n page, as n»ded to provide the required 



Name 


AGGARWAL, Aneel 


Degree(s) Ph.D. 


Position Title 


Assistant Professor 


D.O.B. REDACTED 


Organization 


College of Physicians & Surgeons of Columbia University 


Name 


AHNHN, Dennis 


Degree(s) M.D. 


Position Title 


VA Clinical Investigator /Associate Professor 


D.O.B. REDACTED 


Organization 


Veterans Affairs Medical Center & Univ. Colorado School Of Medicine 


Name 


BARANY, Francis 


Degree(s) Ph.D. 


Position Title 


Associate Professor 


D.O.B. REDACTED 


Organization 


Cornell Univeristy Medical College 




Name 


BARANY, George 


Degree(s) Ph. D. 


Position Title 


Professor 


D.O.B. REDACTED 


Organization 


University of Minnesota 




Name 


BATT.Carl 


Degree(s) Ph.D. 


Position Title 


Associate Professor 


D.O.B. REDACTED 


Organization 


Cornell University 




Name 


BERGSTROM, Donald 


Degree(s) Ph.D. 


Position Title 


Professor 


D.O.B. REDACTED 


Organization 


Purdue Univ. School of Pharmacy &Phann. Sciences 


Name 


BUNK, Michael 


Degree(s) Ph.D. 


Position Title 


Director, Research Management 


D.O.B. _ REDACTED 


Organization 


Strang Cancer Prevention Center 





Social Security # REDACTED 
Role on Project Co-investigator 
Department Bioc h em & Biophys 
Social Security # REDACTED 
Role on Project Collaborat or 
Department Medicine 



Social Security # REDACTED 
Role on Project Prin. Investie. 
Department Microbiology * 
Social Security # REDACTED 



Role on Project Co-investigator 
Department Chemistry 



Social Security # _ redacted 
Role on Project Collaborator 
Department Food Science 



REDACTED 



Social Security # 

Role on Project Co-investigator 
Department Medicinal Chem. 



Social Security # REDACTED 
Role on Project Administrator 
Department Res. Management 
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Organization 
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Name 


CUlrlbKJN, Melissa 
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Degree(s) B.S. 


Position Title 


Graduate Student 
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Organization 


Louisiana State University 




Name 


CUULL, James 


Degree(s) Ph.D. 


Position Title 


Group Manager of Specialty Chemistry 
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Organization 
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Name 
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Degree(s) M. D. 


Position Title 


Professor, Associate Chairman 


D.O.B. REDACTED 


Organization 
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Name 
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Research Associate 
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Name 
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Director of Research 
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Name 
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Professor, Director of Tissue Bank 


u.u.d. REDACTED 


Organization 
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Name 
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Organization 
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Name 
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Organization 
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Name 
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Name 
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Name 
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Name 
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Degree(s) Ph.D. 


Position Title 


Assistant Professor 
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Organization 
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Name 
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Position Title 


Assistant Professor 
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Organization 


Boston University School of MeHirinp 




Name 
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Assistant Professor 
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Name 


KfcNNEDY, Timothy 


Degree(s) M.D. 
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Medical Director 
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Organization 
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Name 
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Role on Project 
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Social Security # REDACT ED 
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DESCRIPTION * State the application's broad, long-term objectives and specific aims, making reference to the health retatedness of the project. Describe concisely the 
«eoarrh rfpsian and methods for achieving these goals; "Avoid summaries of -past-accomplishments and the use of the first person. This abstract is meant to serve as a 
succinct am I accurate description of the proposed work when separated from the application. DO NOT EXCEED THE SPACE PROVIDED. 



The goal of this program project is to develop methods for identifying multiple gene mutations in 
cancers. For maximum utility, these methods must be able to recognize and discriminate between dozens or 
hundreds of mutations. 

To accomplish this, we propose to capture specific ligase detection reaction (LDR) products on a 
spatially addressable array, such that the position of a signal identifies a mutation. Each LDR product will 
have a "zip code" tail, which will be selectively captured by a "complementary zip code" on a solid support. 
The complementary componenets can be DNA oligonucleotides or peptide nucleotide analogues (PNA). 
PNA/DNA hybrids have significantly higher T m values than DNA/DNA hybrids. Incorporation of the 
nucleotide analogue, 5-propynyluridine, into DNA zip code and PNA address sequences will further 
increase and optimize T m values (Project 3). Unreacted LDR primer may therefore be washed away at high 
temperatures allowing for a higher sensitivity in detecting LDR products. A reusable, universal addressable 
array could be used for detecting a wide range of cancer mutations, genetic diseases and infectious agents. 

Implementation of these concepts, with the ultimate goal of achieving reliable and efficient materials 
and procedures that can be incorporated into easy-to-use, automated, low-cost diagnostic devices, will 
follow these aims: (i) Development and evaluation of solid support materials compatible with chemical 
synthesis of DNA oligonucleotides and PNA oligomers, and compatible with subsequent hybridization 
reactions. Surfaces, beads, or membranes will be functionalized, and extended as needed with hydrophilic 
spacers such as heterobifunctional polyethylene glycol (PEG) and/or carbohydrates. Chemistry for linking 
oligomers to the solid support, and/or solid-phase assembly of oligomers, will be developed, (ii) 
Establishment of methodology for synthesis of spatially addressable arrays of DNA oligonucleotides and 
PNA oligomers. Appropriate masking technology will expose defined regions of the solid support for 
attachment of pre-formed oligomers, or for chain elongation to assemble the needed oligomers. In the latter 
mode, segment condensation will be used when possible in order to provide efficient convergent synthesis, 
and because chemical J_*failures" will become "invisible" during the subsequent hybridization, (iii) 
Demonstration of scope and limitations of zip code concepts. As aims (i) and (ii) come to fruition, testing 
will be carried out (Core B). Design of primer and zip code structures will be facilitated by the informatics 
collaboration (Core A). " ~ " 



PERSONNEL ENGAGED ON PROJECT, INCLUDING CONSULTANTS/COLLABORATORS. Use continuation pages as needed to provide 
the required information In the format shown below on of individuals participating in the project. 



Name 


BARANY, George 


Degree(s) Ph.D. 


Social Security # REDACTED . 


Position Title 


Professor 


D.O.B. REDACTED 


Role on Project Prin. Investie. 


Organization 


University of Minnesota 




Department Chemistry 


Name 


HAMMER, Robert 


Degree(s) Ph.D. 


Social Security # REDACTED 


Position Title 


Assistant Professor 


D.O.B. RPDACTED 


Role on Project Co-investieator 


Organization 


Louisiana State University 




Department Chemistry 


Name 
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Degree(s) Ph.D. 


Social Security # redacted 


Position Title 


Postdoctoral Associate 
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Role on Project 
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University of Minnesota ' 




Department Chemistry 


Name 


VAGNEROVA, Lydie 


Degree(s) B.S. 


Social Security # pending 


Position Title 


Research Technician 


D.O.B. RPDACTED 


Role on Project 


Organization 


University of Minnesota 




Department Chemistry 


Name 




Degree(s) 


Social Security # 


Position Trtle 




D.O.B. 


Role on Project 


OrgariSaffdnF 






Department 


Name 




Degree{s) 


Social Security # 


Position Title 




D.O.B. 


Role on Project 


Organization 






Department 
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IX) Principal Investigator/Program Director (Last, 
DETAILED BUDGET FOR INITIAL BUDGET PERIOD 
DIRECT COSTS ONLY 


first, middle]" 
rHUM 

94/12/01 


THROUGH 
95/11/30 


ppRsnNNFL fAnnlicant Organization Only) 


TYPE 
APPT. 
(months) 


% 

B+CRT 
PBOJ. 




DOLLAR AMO 


UNT REQUEST 1 


;D (omit cents) 


NAME 


ROLE ON 
PROJECT 


INST. 
BASE 
SALARY 


SALARY 
REQUESTED 


FRINGE 
BENEFITS 


TOTALS 


George Barany (AY) 


Principal 
Investigator 


9 


5 






George Barany (SS) 


Principal 
Investigator 


3 


16.7 




Josef Vagner 


Post-Doc 
Associate 


12 


100 




Lydie Vapnerova 


Research 
Technician 


12 


100 


























































PROJI 


ECT 5 — 






























SUBTOTALS -> 


$55,623 


$8,446 


$64,069 1 



CONSULTANT COSTS 



EQUIPMENT (Itemize) 



SUPPLIES (Itemize by category) 
Chromatography $3,000 
Chemicals $5 ,000 
Special Solvents & Reag $7,500 
for PNA synthesis 



TRAVEL 

One trip per year for P.I, to present results 



$1,200 




PATIENT CARE COSTS 



INPATIENT 



OUTPATIENT 



ALTERATIONS AND RENOVATIONS (Itemize by category) 



OTHER EXPENSES (Itemize by category) 
See following page $5,000 



SUBTOTAL DIRECT COSTS FOR INITIAL BUDGET PERIOD 

CONSORTIUM/CONTRACTUAL COSTS 

DIRECT COSTS 



$85,769 



TOTAL- 



INDIRECT COSTS 



40% Direct 



TOTAL DIRECT COSTS FOR INITIAL BUDGET PERIOD 
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BUDGET FOR ENTIRE PROPOSED PROJECT PERIOD PROJECT 5 

DIRECT COSTS ONLY 



TOTALS 


INITIAL BUDGET 
(from page 4) 


ADDITIONAL YEARS OF SUPPORT REQUESTED 


2nd 


3rd 


4th 


5th 


PERSONNEL 
Salary a fringe oanems 
Applicant organization only 


$64,069 




$69 297 


$72 069 




CONSULTANT COSTS 


$0 


$0 


$0 


$0 


$0 


EQUIPMENT 


$0 


$2,000 


$2,000 


$2,000 


$2,000 


SUPPUES 


$15,500 


$16,120 


$16,765 


$17,436 


$18,133 


TRAVEL 


$1,200 


$1,248 


$1,298 


$1,350 


$1,404 


PATIENT 

CARE 

COSTS 


INPATIENT 


$0 


$0 


$0 


$0 




OUTPATIENT 


$0 


$0 


$0 


$0 


$0 


ALTERATION 
RENOVATION 


IS AND 
IS 




$0 


$0 


$0 


$0 


OTHER EXPENSES 


$5,000 


$5,200 


$5,408 


$5,624 


$5,849 


SUBTOTAL DIRECT COSTS 


$85,769 


$91,200 


$94,768 


$98,479 


$102338 


CONSORTIUM/ 
CONTRACTUAL COSTS 


$34,308 


$35,680 


$37,107 


$38,591 


$40,135 


— TOTAL DIRECT COSTS 




$126,880 






$142,473 


$120,077 


$131,875 


$137,070 


TOTAL DIRECT COSTS FOR ENTIRE PROPOSED PROJECT PERIOD (Item 8a)-> 


$658375 



JUSTIFICATION (Use continuation pages if necessary): 

From Budget for Initial Period: Describe the^pecificJunctions_of _the personnel collaborators, and consultants and identify 
individuals with appointments that areless than fuirtime for a specific period of the year/includirigVA appointments. 



For All Years: Explain and justify purchase of major equipment, unusual supplies requests, patient care costs, alterations and 
renovations, tuition remission, and donor/volunteer costs. 

From Budget for Entire Period: Identify with an asterisk (*) on this page and justify any significant increase or decrease in 
any category over the initial budget period. Describe any change in effort of personnel. 

For Competing Continuation Applications: Justify any significant increases or decreases in any category over the current 
level of support 



INITIAL BUDGET PERIOD: 



General: This budget covers only the expenses in Dr. George Barany's laboratory at the University 
of Minnesota, and represents modest levels by comparison to other ongoing and past grants from NIH. The 
significant intellectual and experimental contributions to the project of Dr. Robert P. Hammer from 
Louisiana State University, and Dr. Francis Barany of Cornell University Medical College, are supported by 
separate budgets in this program project. 

Personnel: The preparation, manipulation, and characterization of oligonucleotide and PNA 
building blocks and oligomers, as well as of a range of solid supports needed for synthesis and 
hybridizations, is quite labor-intensive and requires experienced and well-trained co-workers. Dr. Josef 
V£gner is a highly qualified peptide chemist who has been working in Dr. Barany' s laboratory for over a 
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year on other projects. Ms. Vagnerova holds a degree in biochemistry and has recently arrived in this 
country to join the research group as a technician. These two individuals will be able to make an immediate 
impact to this Research Plan. Salaries follow University of Minnesota guidelines, and fringe benefits are 
calculated at 23.7% academic, 3.6% postdoctoral, and 27.5% civil service. Professors have 9-month 
academic appointments, and need to cover the 3-month summer salary from grants. 

Equipment: Dr. Barany's laboratory has all of the major instrumentation required to carry out this 
research, with several new instruments acquired recently to replace and/or augment older models (listed 
with "Resources and Environment"). 

Supplies and Other Expenses: We are perpetually underfunded in these categories, and require a 
combination of grants to cover these costs. "Supplies" include chemical reagents, consumable supplies, and 
chromatography expenses. "Other Expenses" cover analytical fees (NMR, mass spectrometry, elemental 
analysis), instrument maintenance (service contracts on peptide synthesizers and amino acid analyzer shared 
with other grants), publication costs, communications, etc. 

Travel: Funds are requested to allow attendance at one professional meeting per year in order to 
present results and learn of advances in scientific fields related to this proposal. 

Consortium/Contractual Costs: The University of Minnesota negotiated (May 13, 1992) indirect 
cost rate is 40% of modified direct costs (total direct costs - equipment - graduate student benefits). 

CONTINUATION YEARS: 

Personnel: Dr. Barany's research program attracts a constant stream of postdoctoral applicants from 
good laboratories throughout the world, so there will be no difficulty in appointing individuals to continue 
the work after Dr. V£gner leaves. Similarly there is a good pool of candidates for technician positions. The 
"Research Plan" will require constant staffing at the level of two individuals. 

Equipment : We request $2,000 per year to cover relatively small items of lab hardware. 

Increases : Following NIH guidelines, the percentage recurring annual increase in all costs are 
calculated at 4%. 

Five years of support are requested in order to allow enough time to show significant progress on the 
"goals~of the "Research Plan." 
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RESOURCES AND ENVIRONMENT 



FACILITIES' Mark the facilities to be used at each performance site listed in Item 9. Face Page, and briefly indicate their capacities, pertinent [capabilities, relative 
Droximitv anc I extent of availability to the project. Use 'Other to describe the facilities at any other performance sites listed in Item 9 on the Face Page and at sites for 
field studies. Use continuation pages if necessary. Include an explanation of any consortium/contractual arrangements with other organizations. 



□ 



Laboratory: 



□ Clinical: 



The Barany group currently numbers about a dozen full-time researchers, and two and 
a half 490 ft 2 laboratories (Kolthoff 476, 468, and 470; listed in order of length of 
occupancy; 4 desks per lab; common service corridor). These labs are down the hall 
from the faculty office. A 260 ft 2 laboratory (Kolthoff 463 A, 1 desk) adjoins the 
office and is used for work by Dr. Barany and a laboratory technician and /or 
undergraduate research assistants. The group also occupies a 300 ft 2 instrumentation 
room (Kolthoff 472), and has a 125 ft 2 section of a shared instrumentation room 
(498A). All of this is in the Department of Chemistry on the Minneapolis campus of 
the University of Minnesota. 



Q Animal: 



Q Computer 



Three Macintosh personal computers for word processing and graphics 



□ Office: 



140 ft 2 (Kolthoff 461) 



□ Other (J: 



MAJOR EQUIPMENT: Ust the most important equipment items already available for this project, noting the location and pertinent capabilities of each. 

MilliGen/Biosearch 9050 and 9600 Peptide Synthesizers (acquired 1990); Beckman System 6300 High 
Performance Amino Acid Analyzer (acquired 1989); Beckman-Altex analytical gradient HPLC apparatus 
(acquired 1981) with variable wavelength UV detector and Hewlett-Packard recording integrator; Waters 
Delta -Prep 3000 HPLC (acquired 1988) apparatus with UV detector, integrator, and automatic sample 
injector; Beckman P/ACE 2100 capillary zone electrophoresis system (acquired 1991); MPLC set-up; UV- 
visible spectrophotometer; photolysis equipment; fraction collectors; Labconco lyophilizer; basic organic 
synthesis set-up; excellent hoods in all of the laboratories. 



ADDITIONAL INFORMATION: Provide any other information describing the environment tor the project Identity support services such as consultant, secretarial, 
.-machine shop^andjBlectranics shop, and me extent to whjch.they wjll be available to the project __ _ 

Buildings of the Department of Chemistry contain major instrumental facilities for routine and High-field *H 
and 13 C-NMR, IR, mass spectrometry (including FABMS), and X-ray diffraction which are extensively used 
in this research program. Machine, electronics, and glassblowing shops are on-site, as is a research 
stockroom. The Department offers quarter-time secretarial support to the faculty, and Dr. Barany 
—occasionally gets additional-secretarial help .paid forJby research-grants. „An additional resource for this 
research is the Microchemical Facility of the University of Minnesota Institute of Human Genetics on the 
Minneapolis campus, which includes state-of-the-art equipment for amino acid analysis, peptide sequencing 
and synthesis, oligonucleotide synthesis, and other procedures. 
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A. SPECIFIC AIMS: 

This Research Plan seeks to develop and optimize new tools that will be essential components to the 
integrated approach to cancer, genetic, and infectious disease detection described in the overall program 
project application. A novel polymerase chain reaction/ligase detection reaction (PCR/LDR) method for 
discriminating normal, carrier, and disease individuals has been described, and a high-sensitivity 
PCR/restriction endonuclease/LDR (PCR/RE/LDR) method for detection of rare cancer mutations is under 
development (Projects 1 and 2). These technologies, in their present form or as improved by applications of 
"convertide" nucleotide analogue bases (Project 3) and/or engineered thermostable ligase (Project 4), will be 
carried out in multiplex formats to simultaneously identify many mutations. LDR products, derived from 
one fluorescent primer and an adjacent primer with extra nucleotides or hexaethylene oxide "tails", are 
currently separated by size on an automated DNA sequencer, or by capillary electrophoresis. Use of 
different fluorescent groups allows a second dimension of mutation discrimination. — 

Herein, we propose new solid-phase approaches for simultaneous detection of multiplex LDR 
products. The general idea is that specific products will be captured on a spatially addressable array, so that 
the position of a signal identifies a mutation. Each LDR product will have a "zip code" tail, which will be 
captured selectively by a "complementary zip code" on the solid support. The supported (complementary) 
components can be modified DNA oligonucleotides or peptide nucleotide analogues (PNA), designed so 
that the resultant zip code hybrids have a significantly higher T m than DNA/DNA hybrids. Unreacted 
primers may be washed away at high temperatures, allowing for detection, of the LDR product. Multiple 
reuse of a universal "complementary zip code" array is envisaged to allow detection of a wide range of 
cancers and genetic diseases. 

Implementation of these concepts, with the ultimate goal of achieving reliable and efficient materials 
and procedures that can be incorporated into easy-to-use, automated, low-cost diagnostic devices, will 
follow these aims: 

(i) Development and evaluation of solid support materials compatible with chemical synthesis 
of DNA oligonucleotides"ana~PNA oligomers, and "compatible with sulSsequent" hybridization 
reactions. Both commercially available and experimental materials will be screened. Surfaces, 
beads,~or membranes- wilhbe-functionalizedrand extended as needed with hydrophilic spacers such 
as heterobifunctional polyethylene glycol (PEG) and/or carbohydrates. Chemistry for linking 
oligomers to the solid support, and/or solid-phase assembly of oligomers, will be developed. 

(ii) Establishment of methodology for synthesis of spatially addressable arrays of DNA 
oligonucleotides and PNA oligomers. Appropriate masking technology will expose defined 
regions of the solid support for attachment of pre-formed oligomers, or for chain elongation to 
assemble the needed oligomers. In the latter mode, segment condensation will be used when 
possible in order to provide efficient convergent synthesis, and because chemical "failures" will 
become "invisible" during the subsequent hybridization. 

(iii) Demonstration of scope and limitations of zip code concepts. As aims (i) and (ii) come to 
fruition, testing will be carried out (Core B). Design of primer and zip code structures will be 
facilitated by the informatics collaboration (Core A). 

B. BACKGROUND AND SIGNIFICANCE 

The cancer-detection technology of this program project application relies in good part on the 
capability to prepare by rapid, accurate chemical methods a multitude of oligonucleotide and related 
structures of defined sequence in the 20 to 50-base size range. The revolutionary solid-phase approach, 
introduced by Merrifield in the 1960's for peptides, points the way and provides ample experiences and 
precedents [1-6]. In solid-phase methodology that has been refined substantially and is readily automated, 
suitably protected amino acid building blocks are added in order (C to N) to a growing chain which is 
attached covalently through the C-terminus to a polymeric support. The principal commercially available 
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chemistries are referred to as "Boc"-and "Fmoc", abbreviations for the names of the key temporary 
protecting groups. Often, a linker or handle is used to mediate the initial anchoring. Reactions are driven to 
completion by the use of excess reagents, which are removed by simple filtration and washing steps; the 
chemistry can also be carried out in columns in a continuous-flow mode. Upon completion of chain 
assembly, protecting groups are cleaved and the peptide is released into solution for further purification and 
characterization. An important aspect for successful results is the choice of the polymeric support. For 
many years, most work was carried out on 1% cross -linked microporous polystyrene resins (beaded), or on 
polyacrylamides (these latter could be embedded within an inorganic matrix, e.g., kieselguhr, or a rigid 
macroporous polystyrene, e.g., Polyhipe) [6-12]. Within the past few years, several additional materials 
with interesting physico-chemical properties have become available from several academic laboratories and 
commercial sources. These include membranes [13, 14], cotton and other appropriate carbohydrates [15- 
18], controlled-pore silica glass [19]^ and linear polystyrene grafted onto Kel-F [20]. A particularly 
interesting concept involves the use of polyethylene glycol-polystyrene supports (PEG-PS or Tentagel), 
which are compatible with both batchwise and continuous-flow reactors, and may facilitate difficult 
chemistries in peptide synthesis [21-23] . Other recent trends from the peptide field which are relevant to the 
goals of this proposal involve synthesis on polymeric surfaces, and the simultaneous preparation of multiple 
structures. In these procedures, due to Geysen, Frank, and research groups at Affymax, Arris, and 
Millipore, among others, relatively short peptides are built up on appropriately modified polyethylene pins, 
cellulose or polypropylene membranes, or glass surfaces, in a way that the bound final structures (purities in 
the 60 to 90% range) are tested directly by ELISA or other biological testing methods [15, 24-27]. The 
active structures are then deduced from their physical position, i.e., spatial address on the array. In an 
alternative combinatorial library approach developed at Selectide [28], millions of peptides are generated 
simultaneously by successive cycles of randomization/remixing of beaded supports. Intrinsic to the design 
of these experiments, each individual bead contains only a single peptide, so that those beads giving a 
positive interaction with a receptor can be picked out and subjected to analytical procedures that give the 
structure on the bead. 

Solid-phase oligonucleotide synthesis has come to the fore in the past eight years with the 
development of reliable high-efficiency phosphoramidite [29] or H-phosphonate chemistry [30, 31] for 
linking protected nucleoside building blocks. Synthesis (3' to 5') is usually supported on controlled-pore 
glass, although other materials can also serve. The current automated methodology can routinely furnish 
oligonucleotides of length > 50 residues in overall purity > 90% directly upon release from the support. The 
.methodology can also be adapted to incorporate unusual nucleotide bases, as well as modifications in the 
: phosphodiester backbone (e.g., non-bridging thio or dithio substitution) and end-group labels (e.g., 
fluorescent dyes, biotin) [32]. As with peptides, relatively short oligonucleotides can be synthesized in 
spatially addressable arrays on glass surfaces [33-37] . Alternatively, a variety of procedures have been 
described for site-specific attachment of pre-synthesized oligonucleotide probes to nylon membranes or 
inside polyacrylamide gels [38-40] . Such arrays have been applied for DNA hybridization reactions, with 
applications to DNA sequencing or detection of biotinylated PCR-amplified products. These earlier studies 
provide useful precedents to some of the goals of this proposal. 

Within the last few years, a group from Denmark [41-44] has introduced novel peptide nucleotide 
analogue (PNA) oligomers which mimic closely the spatial arrangement of the oligonucleotide backbone, 
but use nonchiral (2-aminoethyl)glycine units to replace the sugar phosphodiester (Figure 1). Additional 
innovations for PNA chemistry are under development at Millipore, in close consultation and collaboration 
with us [45]; These improvements include complete protection schemes for all the "bases" in concert with 
Boc, Fmoc, or alternative chemistries, optimized coupling (note: since racemization is not an issue, strong 
activation methods can be applied) and capping protocols, and efficient sequencing procedures. PNA and 
~ Tingle -sf^ join to form anti-parallel heteroduplexes that exhibit Watson-Crick specificity and 

(particularly under low-salt conditions) tighter binding (higher T m ) than the corresponding double -stranded 
DNA [44] . As is discussed later, these properties dovetail extremely well with some of the requirements for 
the multiplex cancer detection protocol proposed in this program project application. 
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Fig. 1. Structural similarity of DNA and PNA. 

C. PRELIMINARY STUDIES 

The previous section of this proposal gave a brief overview of the current status of solid-phase 
methodologies, with an emphasis on general aspects which are relevant to the objectives of the research 
program. The following paragraphs provide brief descriptions of recent experimental advances from our 
laboratories which place us at the cutting edge of methodological developments. 

We have invented several procedures for grafting both homo- and heterobifunctional polyethylene 
glycol (PEG) derivatives of defined molecular weight onto amino-functionalized polystyrene (PS); the 
resultant nricroporous beaded PEG^PS supports-are-now commercially available thrpugh_Millipore and have 
numerous advantageous properties with respect to polystyrene [22, 23]. Our extensive experience in this 
field will be needed to introduce PEG as a "spacer" separating oligonucleotide or PNA molecules from 
functionalized surfaces. A useful start in this regard was reported recently by our collaborators Dr. Dereck 
Hudson and Dr. Ronald Cook [27] who in turn derivatized polyethylene plates, modified the resultant 
surfaces with PEG, and coupled carboxymethyl dextran to impart further hydrophilicity and serve as a 
starting point for peptide synthesis. (Please se e letters of c ollaboratio n in overview se ction of program 
project). 

PEG-PS has proven to be an ideal support for peptide library studies by the Selectide process [28] . 
PEG-PS is compatible with the organic reagents and solvents for efficient synthesis throughout the beads, 
and also has sufficient hydrophilic character to allow biological testing in aqueous milieus. Based on the 
realization that biological interactions occur only at the surface of beads, we have devised methodology for 
differentiation of "surface" and "interior" areas. PEG-PS is loaded with Boc-Trp-Gly, following which 
chymotryptic digestion "shaves" selectively only substrates at the surface which are accessible to the 
macromolecular enzyme. The exposed glycine is the starting point for orthogonal peptide synthesis using 
Fmoc chemistry, so that each bead is charged with a "screening" peptide at the surface, representing <1% of 
the total content but responsible for the entire spectrum of biological interaction. In concert, Boc chemistry 
establishes a sequenceable "coding" peptide confined to the interior, representing the vast majority of 
material on the bead but restricted from biological interaction [46] . This "shaving" concept has implications 
beyond the application to encoded combinatorial libraries just described; in the context of the present 
Research Plan, it can be used to ensure that oligopolymers synthesized on surfaces will be able to hybridize 
to oligonucleotides. 

Our laMrMQrLeA hay? also pioneered the development of noveHinkers and handles for peptide 
synthesis [47, 48]; the extension to DNA and PNA is expect^to^stiaighTfb^aia; CJf^arficuIar interest, 
we have developed tris(alkoxy)benzyl amide (PAL) [49] and ester (HAL) [50] linkages, which upon 
cleavage with acid provide respectively C-terminal peptide amides, and protected peptide acids that can be 
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used as building blocks for so-called segment condensation approaches. We have noticed that the stabilized 
carbonium ion generated in acid from cleavage of PAL or HAL linkages can be intercepted by tryptophanyl - 
peptides. While this reaction is a nuisance for peptide synthesis and preventable (impart) by use of 
appropriate scavengers, we envisage herein a positive application to chemically "capture" oligo-Trp-end- 
labelled DNA and PNA molecules by HAL-modified surfaces (see Fig. 7 in Experimental Designs and 
Methods). 



D. EXPERIMENTAL DESIGN AND METHODS 
(i) Overview 

(a) Array technology. This Reseach Plan describes a systematic approach to the design and 
synthesis of oligonucleotide or PNA arrays to achieve accurate detection and quantification of cancer 
mutations. Several groups have attempted to make oligonucleotide arrays with various degrees of success 
[33, 37- 40]. These approaches may be divided into three categories: (i) Synthesis of oligonucleotides by 
standard methods and their attachment one at a time in a spatial array [38-40] (ii) Photolithographic masking 
and photochemical deprotection on a silicon chip, to allow for synthesis of short oligonucleotides [37] , and 
(iii) Physical masking to allow for synthesis of short oligonucleotides by addition of single bases at the 
unmasked areas [33, 36]. Although considerable progress has been made in constructing oligonucleotide 
arrays, some containing as many as 256 independent addresses, severe limitations have been noted in using 
these arrays for detecting specific DNA sequences by hybridizations. Arrays containing longer 
oligonucleotides can currently be synthesized only by attaching one address at a time, and thus are limited in 
potential size. (Current methods for attaching an oligonucleotide take about 1 hour, thus an array of 1,000 
addresses would require over 40 days of around-the-clock work to prepare.) The "reverse dot blot" 
approach is capable of distinguishing single base differences in homozygous or heterozygous individuals, as 
well as the presence of a ras mutation diluted 20-fold by normal DNA [40]. However, hybridization 
methods require careful attention to temperature and salt conditions, and cannot achieve the high sensitivity 
of - th&~ cancer detection methods described inohis proposal. Arrays containing large numbers of short 
oligonucleotides have performed significantly better on the computer than in practice. Syntheses on 
membranes or silicon chips are plagued by less than 100% efficiency, effectively limiting the size of these 
oligonucleotides to 8- to 10-mers. Imperfect hybridizations generate significant-background signals, which 
severely hamper use of these arrays for DNA sequencing [36] . 

(b) Zip code concept. This proposal introduces a novel approach to oligonucleotide arrays which 
should obviate the above problems. One significant difference between our approach and literature array 
methods is that we use the array as a means to capture the correctly generated product. While others try to 
distinguish closely related sequences by subtle differences in melting temperatures during hybridization, we 
have already achieved the required exquisite specificity due to the discriminating actions of thermostable 
ligase in solution. Thus, our arrays can be designed to contain sequences which are very different from 
each other. Our array may be likened to 1,000 different antibodies that bind 1,000 different antigens with 
tight binding constants and no cross-reactivity. These arrays are completely universal, so that a single 
design may be used in detection of infectious and genetic diseases, or cancers. Best of all, the arrays will be 
highly stable and reusable. 

The 1,000 different "antigens" are unique 24-mer "zip code" sequences linked covalently to the 
approximately 20- to 25-mer target-specific sequence of an LDR primer. A "zip code" sequence does not 
have any homology to either the target sequence or to other sequences on the genome, This zip code tail is 
then captured by~its "antibody", a sequence complementaiyto therzip code on the addressable-solid support 
array. The concept is shown in two possible formats for detection of the p53 R248 mutation (Fig. 2). At the 
top of the diagram shows two alternative formats for primer design to identify the presence of a germ line 
mutation in codon 248 of the p53 tumor suppression gene. The wild type sequence codes for arginine 
(R248) while the cancer mutation codes for tryptophan (R248W). The lower part of the diagram is a 
schematic for zip code capture. In the first format (A), the discriminating primers contain the allelic specific 
base (T for mutant and C for wild type) on the 3' end and 24-mer zip codes Zl and Z2 on their 5* ends 
respectively. A common downstream primer contains a fluorescent group Fl at its 3' end. In the presence 
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presence of appropriate target DNA (wild type DNA is shown), the correct ligation products form. After 
hybridization of the zip code primers to their complementary sequences on the addressable array, unreacted 
fluorescent primers will be washed away. Mutant and wild type signal may be quantified using a 
Fluorlmager, and distinguished by their position on the array. In an alternative format (B), the 
discriminating oligonucleotides contain two different fluorescent groups Fl and F2, while the common 
oligonucleotide contains the zip code Zl. In this format, mutant and wild type signal are distinguished by 
the differences in fluorescence between Fl and F2 (see legend of Fig. 2 for more details). 
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Fig. 2. Two alternative formats for zip code capture (see following page for legend). 
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- Two alternative formats for zip code capture -(legend for previous page. The top portion of the diagram shows two alternative 
formats for primer design to identify the presence of a germ line mutation in codon 248 of the p53 tumor suppressor gene. The 
wild type sequence codes for arginine (R248), while the cancer mutation codes for tryptophan (R248W). The bottom part of the 
diagram is a schematic diagram of zip code capture. The thick horizontal line depicts the membrane or solid surface containing 
the addressable array. The thin curved lines indicate a flexible linker arm. The thicker lines indicate a PNA sequence, attached to 
the solid surface in the C to N direction. For illustrative purposes, the PNA oligonucleotides are drawn vertically, making the 
linker arm in section B appear "stretched". Since the arm is flexible, the oligonucleotide will be able to hybridize 5* to C and 3* to 
N in each case, as dictated by base pair complementarity. A similar orientation of DNA/PNA hybridization would be allowed if 
the PNA were attached to the membrane at the N-terminus. Similar considerations apply when the complementary zip code on 
the support is a DNA oligonucleotide rather than PNA. (A) Two LDR primers are designed to discriminate wild type and mutant 
p53 by containing the discriminating base C or T at the 3' end. In the presence of the correct target DNA and Tth ligase, the 
discriminating primer is covalently attached to a common downstream oligonucleotide. The downstream oligonucleotide is 
fluorescently labeled. The discriminating oligonucleotides are distinguished by the presence of a unique "zip code" sequences, Zl 
and Z2, at each of their 5' ends. A black dot indicates th'at target dependent ligation has taken place. "After ligation, all zip code 
primers may be captured by their complementary "zip code" sequences at unique addresses on the array. Both ligated and 
unreacted primers are captured by the PNA an-ay. Unreacted fluorescently labeled common primers and target DNA are then 
washed away at a high temperature (approximately 65°C to 80°C) and low salt. Mutant signal is distinguished by detection of 
fluorescent signal at the 21 position, while wild type signal appears at the Z2 position. Heterozygosity is indicated by equal 
signals at both Zl and Z2. The signals may be quantified using a Molecular Dynamics Fluorlmager. This format uses a unique 
address for each allele, and may be preferred for achieving very accurate detection of low levels of signal (30 to 100 attomoles of 
LDR product). (B) In this format, the discriminating oligonucleotides are distinguished by having different fluorescent groups, Fl 
and F2, on their 5' end. Either oligonucleotide may be ligated to a common downstream oligonucleotide containing a zip code 
sequence Zl "on its 3' end. In this format, both wild-type and mutant LDR products are captured at the same address on the array, 
and are distinguished by their different fluorescence. This format allows for a more efficient use of the array and may be 
preferred when trying to detect hundreds of potential germline mutations. 

(c) Design and synthesis of arrays. This Research Plan will explore variations of two general 
approaches for synthesizing arrays. In the first approach, we will prepare full-length 24-mer DNA 
oligonucleotides or PNA oligomers, which are subsequently linked covalently to a solid support or 
membrane. Alternatively, the deprotected DNA or PNA may remain linked to the bead, and the entire bead 
..glued to a solid support. In the second approach, 36 specially designed PNA tetramers will be synthesized. 
These tetramers will be added to specific rows or columns on a solid support or membrane surface. The 
resulting "checkerboard" pattern will generate unique addressable arrays of PNA 24-mers. 

We will initially explore glass and derivatized membrane supports to test their sensitivities and 
capacities as array surfaces. Pilot experiments will involve synthesis of five zip code PNA oligomers or 
oligonucleotides (sequences listed in Table 2, later). These oligomers will be covalently linked to the test 
surfaces. Fluorescently labeled complementary DNA zip code sequences will be synthesized in Core B, and 
used for testing arrays produced in this project. Note that for the initial studies, we do not require the longer 
conjugates that combine the LDR primer with the zip code (see Fig. 1). 

What properties are desired in an array? The most important factor is good loading of 
oligonucleotide or PNA oligomer in a relatively small, but well-defined area. The current commercially 
available fluorescent imager can detect a signal as low as 2 attomoles per 50|i square pixel. Thus, a 
reasonable size address or "spot" on an array would be about 4x4 pixels, or 200|X square. The limit of 
detection for such an address would be about 32 attomoles per "spot", which is comparable to the 100 
attomole detection limit using a DNA sequencing machine. The capacity of oligonucleotide which can be 
loaded per 200|i square will give an indication of the potential signal to noise ratio. A loading of 20 fmoles 
would give a signal to noise ratio of 625 to 1, while 200 fmoles would allow for a superb signal to noise 
ratio of 6,250 to 1. Loadings in excess of 200 fmoles will be unnecessary, since most LDR reactions use 
only 200 fmoles of each primer. The oligonucleotides P-NA-oligomer-should -be on-a flexible- linker arm" 
and on the "outside" or "surface", of the solid support for easier hybridizations. The support should be non- 
fluorescent, and should not interfere with hybridization nor give a high background signal due to nonspecific 
binding. In a mode where bead(s) are attached (typical size 50-200|i), neither beads nor "glue" should give 
a high background signal due to nonspecific binding or intrinsic fluorescence. 

This proposal also introduces a novel approach for the design and synthesis of a universal PNA 
oligonucleotide array with 1,296 addresses. We envision each address to be about 200|i with an equal size 
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space in between addresses. Feasibility will be assessed with a 25 address array that is about 2 mm square; 
tie full-sized array would be about 1 .4 cm square. Preparation of such arrays (pilot and full-sized) will be 
carried out as a joint project with our industrial collaborators Dr. James Coull and his team at Millipore, and 
Dr Ronald Cook who heads Siris Labs. (Please see letters of collaboration in overview section). The 
required preliminary synthesis in the academic laboratories will use a Biorad dot blot apparatus which 
contains individual microtiter wells sandwiched around a membrane. This allows for addition and filtration 
of chemicals in each well. 

(ii) Design and optimization of zip codes and addresses 

(a) General considerations. The principle of using zip codes has been explained earlier. The 
complementary zip codes (addresses) on the solid supports can be either DNA or PNA. Both will be tested. 
However, we expect that PNA-based capture of zip codes may have advantages over DNA-based capture 
because PNA/DNA duplexes are much stronger than DNA/DNA duplexes, by about l°C/base-pair [44]. 
Thus, for a 24-mer DNA/DNA duplex with T m = 72°C, the corresponding duplex with one PNA strand 
would have a "predicted" T m = 96°C (the actual melting point might be slightly lower as the above "rule of 
thumb" is less accurate as melting points get over 80°C). Additionally, the melting difference between 
DNA/DNA and PNA/DNA becomes even more striking at low salt. 

(b) Enhancement of the hybridization affinity of zip code/address duplexes. The melting temperature 
of DNA/DNA duplexes can be estimated as [4n(G»C) + 2m(A«T)]°C. If possible, we would like to narrow 
the T m difference between zip code duplexes resulting from differences in G»C/A»T content, and in this way 
further optimize zip code capture. Froehler has shown that use of 5-propynyl-dU in place of thymine 
increases the T m of DNA duplexes an average of 1.7°C per substitution [51]. We suggest that the same 
substitution in the zip code capture scheme would lower the T m difference between zip code/address 
duplexes, and raise the T m for all of the zip code/address duplexes. Phosphoramidite derivatives of 
5-propynyl-dU (Fig. 3) will be prepared according to Froehler [51]. The 5 -propynyluracil PNA monomer 
with Fmoc amino protection will be made (Fig. 4) following the published synthesis of PNA monomers [41, 
42], replacing thymine with 5-iodouracil and using Pd(0) coupling of the alkylated 5^iodouracil and 
prppyne. These monomers will be incorporated into synthetic DNA and PNA strands, respectively, and 
evaluated as described later. 




Fig. 3. Structure of nucleoside analogue 5-propynyl-dU. 
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The Boc-protected derivative could be made by a similar route. 
Fig, 4. Synthesis of Fmoc-protected 5-propynyl-uridine PNA monomer. 



(c) Zip code sequences designed from tetramer building blocks. Of the 256 (4 4 ) possible ways in 
which four bases can be arranged as tetramers, we have selected 36 that have unique sequences (Fig, 5). 
Each of the chosen tetramers differs from all the others by at least two bases, and no two dimers are 
complementary to each other. Furthermore, tetramers that would result in self-pairing or hairpin formation 
of the addresses have been eliminated (see legend to Fig. 5 for further details of the design process). 

The final tetramers are listed in Table 1, and have been numbered arbitrarily from 1 to 36. Our 
premise is to use this unique set of tetramers as design modules for the required 24-mer zip code and 24-mer 
address sequences. The structures can be assembled by stepwise (one base at a time) or convergent 
(tetramer building blocks) synthetic strategies. Note that the numbering scheme for tetramers allows us to 
abbreviate each zip code as a string of six numbers (e.g., second column of Table 2, in following section). 

id) Initial zip code test sequences. The concept of zip code 24-mers designed from a unique set of 
36 tetramers (Table 1) allows a huge number of possible structures, 36 6 = 2,176,782,336. We have chosen 
=Tive structures (Table 2) that have nearly equal G + C content as targets for the graded-set-of studies that are 
needed to establish the proposed methodology. 

(e) Solution studies of zip code annealing. Our eventual goal is to exploit zip code hybridization to 
— direct fluorescently labelled LDR products towards specific addresses on-a solid support- However-,- first we 
wish r to validate duplex formation in solution. Test sequences (Table 2) will be used. The T m of each 
"duplex will be measured by recording the A"266 "Of the" oligonucleotide solutions (^ 5^iM concentration of 
each single strand) versus temperature. 

Synthetic probes (normal and complementary directions) for the aforementioned studies will be 
prepared as either DNA or PNA, with either all thymine or all 5-propynyl-uracil. Where syntheses are 
straightforward, they will be performed by Core B, but where methodology is still under development, 
syntheses will be performed in the laboratories of program project chemists or industrial collaborators. 
These syntheses will generate for each sequence a total of eight oligomers, which can be combined in 16 
ways that form duplexes. 
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Fig. 5. Design of 36 tetramers which differ from each other by at least 2 bases. Checkerboard pattern shows all 256 possible 
tetramers. A given square represents the first two bases on the left followed by the two bases on the top of the checkerboard. 
Each tetramer must differ from each other by at least two bases, and should be non-complementary. The tetramers are shown in 
the white boxes, while their complements are listed as (number)'. Thus, the complementary sequences GACC (20) and GGTC 
(20') are mutually exclusive in this scheme. In addition, tetramers must be non-palindromic, e.g., TCGA (darker diagonal line 
boxes), and non-repetitive, e.g., CACA (darker diagonal line boxes from upper left to lower right). All other sequences which 
differ from the 36 tetramers by only 1 base are shaded in light gray. Four potential tetramers were not chosen as they are either all 
A«T or G*C bases. In addition, thymine can be replaced by 5-propynyl uridine when used within DNA or PNA address sequences 
as well as in the DNA zip code sequences. This would increase the T m of an A # T base pair by -1.7°C. Thus, T m values of 
individual tetramers should be approximately 15.1°C to 15.7°C. T m values for the full length 24-mers should be 95°C or higher. 
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Table 1. List of tetramer PNA sequences and complementary DNA sequences, which differ from each 
other by at least 2 bases. 



Number 


Sequence (N-C) 


Complement (5-3*) 


G + C 


1. 


TCTG 


CAGA 


2 


2. 


TGTC 


GACA 


2 


3. 


TCCC 


GGGA 


3 


4. 


TGCG 


CGCA 


3 


5. 


TCGT 


ACGA 


2 


6. 


TTGA 


TCAA 


1 


7. 


TGAT 


ATCA 


1 


. 8. 


„ TTAG 


CTAA 


1 


9. 


CTTG 


CAAG 


2 


10. 


CGTT 


AACG 


2 


11. 


CTCA 


TGAG 


2 


12. 


CACG 


CGTG 


3 


13. 


CTGT 


ACAG 


2 


14. 


CAGC 


GCTG 


3 


15. 


CCAT 


ATGG 


2 


16. 


CGAA 


TTCG 


2 


17. 


GCTT 


AAGC 


2 


18. 


GGTA 


TACC 


2 


19. 


GTCT 


AGAC 


2 


20. 


GACC 


GGTC 


3 


21. 


GAGT 


ACTC 


2 


22. 


GTGC 


GCAC 


3 


23. 


GCAA 


TTGC 


2 


24. 


GGAC 


GTCC 


3 


25 


AGTG . _ 


. __ _ CACT 


2 


26. 


AATC 


GATT 


1 


27. 


ACCT 


AGGT 


2 


28. 


ATCG 


CGAT 


2 


29. 


ACGG 


CCGT 


3 


30. 


AGGA 


TCCT 


2 


31. 


ATAC 


GTAT 


1 


32. 


AAAG 


crrr 


1 


33. 


CCTA 


TAGG 


2 


34. 


GATG 


CATC 


2 


35. 


AGCC 


GGCT 


3 


36. 


TACA 


TGTA 


1 



(iii) Solid support materials for array technology 

Earlier sections of this proposal have reviewed the ideal requirements for array support materials, in 
the context of options from the literature and our own extensive experiences. The solid supports must be 
charged with DNA oligonucleotides or PNA oligomers; this is achieved either by attachment of 
pre-synthesized probes, or by direct assembly and side-chain deprotection (without release of the oligomer) 
onto the support. Further, the support environment needs to be such as to allow efficient hybridization. 
Towards this end, three factors may be identified: (i) sufficient hydrophilic character of support material 
(e.g., PEG or carbohydrate moieties); (ii) flexible linker arms (e.g., hexaethylene oxide or longer PEG 
chains) separating the probe from the support backbone; (iii) "shaving" procedures which allow probe 
immobilization or probe synthesis to occur only in the most accessible "surface" areas of the support. It 
should be kept in mind that numerous ostensibly "flat surfaces" are quite thick at the molecular level. 
Lastly, it is important that the support material not provide significant background signal due to non-specific 
binding or intrinsic fluorescence. 
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Table 2. List of initial PNA zip code and complementary DNA oligonucleotides. 



Polvmer 


Zip code 


Sequence 


G+C 


PNA 


16-3-34-2 


-9-1 


NH 2 - 


16 3 34 2 9 1 
CGAA - TCCC - GATG - TGTC - C TTG - TCTG 


-COOH 


13 . 


DNA 


1-9-2-34- 


3-16(c) 


5'- 


CAGA- CAAG- GACA-CATC - GGGA-TTCG 


-3' 


13 


PNA 


7-3-11-2- 


18-1 


NH 2 - 


7 3 11 2 18 1 
TGAT- TCCC -CTCA-TGTC-GGTA- TCTG 


-COOH 


12 


DNA 


1-18-2-11 


-3-7 (c) 


5'- 


CAGA-TACC-GACA-TGAG-GGGA-ATCA 


-3 ' 


12 


PNA 


20-3-14-2 


-7-1 


NH 2 " 


20 3 14 2 7 1 
GACC - TCCC - CAGC - TGTC - TGAT- TCTG 


-COOH 


14 


DNA 


1-7-2-14- 


3-20{c) 


5'- 


CAGA - ATC A - G AC A - GC TG - GGGA - GGTC 


-3 ' 


14 


DMA 


29-3-23-2 


-12-1 


NH 2 ~ 


- 

29 3 23 2 12 1 
ACGG - TC CC -GCAA- TGTC -CACG- TCTG 




-COOH 


15 


DNA 


1-12-2-23 


-3-29(c) 


5'- 


C AGA- CGTG - GAC A- TTGC - GGGA - CCGT 


-3' 


15 


PNA 


13-35-27- 


33-2-7 


NH 2 - 


13 35 27 33 2 7 
CTGT-AGCC-ACCT-CCTA- TGTC -TGAT 


-COOH 


12 


DNA 


7-2-33-27 


-35-13 (c) 


5'- 


ATC A - GAC A - TAGG - AGGT - GGCT - AC AG 


-3 ' 


12 



A variety of materials, which include suitably modified glass, plastic, or cellulose surfaces, PEG-PS 
beads, or a variety of membranes, will be examined in the context of the needs summarized above. These 
materials will be obtained from commercial sourc es or from our industrial collaborators (Dr. James Coull at 
Millipore, Dr. Ronald Cook at Siris), or else will be prepared in our laboratories by following literature 
precedents. Depending on the material, surface functional groups (i.e., hydroxyl, carboxyl, amino) may be 
present from the outset (perhaps as part of the coating polymer), or will require a separate procedure (e.g., 
plasma animation, chromic acid oxidation, treatment with a side-chain functionalized alkyltrichlorosilane) 
for introduction of the functional group. Hydroxyl groups become incorporated into stable carbamate 
(urethane) linkages by several methods. Amino functions can be acylated directly, whereas carboxyl groups 
are activated, e.g., with N,N'-carbonyldiimidazole or water-soluble carbodiimides, and reacted with an 
amino-functionalized compound (Fig. 6). Unreacted amino groups will be blocked by acetylation or 
succinylation, to ensure a neutral or negatively charged environment that "repels" excess unhybridized 
DNA. Loading levels will be determined by standard analytical methods [47] . 

Often, it will be desirable to introduce a PEG spacer with complementary functionalization, prior to 
attachment of the starting linker for DNA or PNA synthesis. The methodology to do so is in hand [22, 23, 
52] and will be pursued alongside with control experiments on the same materials lacking PEG. Similarly, 
dextran layers can be introduced as needed by precedented chemistries [27, 53]. Finally, enzymatic 
"shaving" is carried out readily by our recently developed procedure using chymotrypsin to cleave a short 
substrate that is distributed uniformly throughout a bead or on a derivatized surface. In our studies on 
peptide/receptor (antibody or binding protein) interactions, we h ave shown that shavingjprotocols expose a 
I relatively small portion ^ (approximately 1 to 5%) of The total functional groups, yeftffey reach alllreceptor- 
accessible sites [46]. We plan to establish whether the same site selectivity can be achieved for 
hybridization reactions, and compare the results to controls run on "unshaven" materials. 
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X* 



Y-PEG 



(Y)-PEG-Y* 



-0(C=O)Z 
-0(C=S)Z 
-C0 2 H 
-<C=0)Z 
-NH 2 
-N=C=0 

W = protecting group, e.g. Boc, Fmoc 

Z = activating group, e.g. imidazole (Im), p-nitrophenol (OPnp), 

hydroxysuccinimide (OSu), pentafluorophenol (OPfp) 
PEG = oligo or polyethylene glycol), backbone (CH 2 CH 2 OX, n = 6 to 200 

(can also be grown by anionic polymerization with ^7 ) 
WSC = water soluble carbodiimide 



X,Y = -OH X*. Y* = 

-C0 2 H 

-NH 2 
(Y) = W-NH- 



Functional group transformations/activation (as needed), X -»• X*, Y Y* 

-OH — > -O(CH2)„00 2 H n = 1, 2 

-OH — ► -0(C=0)NHCH 2 C0 2 H 

-OH — > -0(C=0)CH 2 NH 2 

-OH — > -0(C=0)Im 

-OH — y -0(C=S)SCH 2 (C=0)NH 2 

-C0 2 H — y -(C=0)NH(CH 2 )„NH 2 n= 2,6 

-C0 2 H-> -<C=0)Z 

-NH 2 — > -NH(G=0)(GH 2 ) n C0 2 H - n = 2, 3 



Covalent linkage, X* + Y* 

-C0 2 H + H 2 N- + WSC + HOSu — > ^(C=iO)NH- 
-OH + Im(C=0)Im + H 2 N- — ► -0(C=0)NH- 

ci -o ~ -O 

>=N >=N >=N 

-OH+ N />-a — >"N ,>-Cl +H 2 N-— >-N />-NH- 

ci a a 

-OH + 0=C=N- — >• -O(C=0)NH- 
-0(C=S)SCH 2 (C=0)NH 2 + H 2 N- -> -0(C=S)NH- 
-OH + ClCH 2 — ^7 +H0- —> -OCH 2 CH(0H)CH 2 O- 

O (+ H 2 NH-) (NH-) 



-OH — > -OCH 2 (C=0)H + H 2 N- + NaCNBH 3 — ► -OCH 2 CH 2 NH- 



Fig. 6. Chemical reactions for covalent modifications, grafting, and oligomer attachments to solid supports. The solid supports 
can be beads, membranes, or surfaces, with a starting functional group X. Functional group transformations can be carried out in 
a variety of ways (as needed) to provide group X* which represents one partner in the covalent linkage with group Y*. The 
Figure shows specifically the grafting of PEG, but the same repertoire of reactions can be used (however needed) to attach 
carbohydrates^with hydrbxyl), Hrifere"(wifJicarb6xyi), and/or DNA oligonucleotides and PNA oligomers that have been extended 
by suitable functional groups (amino or carboxyl). In some cases, group X* or Y* is pre-activated (isolable species from a 
separate reaction); alternatively, activation occurs in situ. Referring to PEG as drawn in the Figure, Y and Y* can be the same 
(homobifunctional) or different (heterobifunctional); in the latter case, (Y) can be protected for further control of the chemistry. 
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HS |Support| 



pH 8 



JrobJ)— C- CHi-S— fSupportl 



(B) 



Ac-Cys<Probe) + 



N— (CH 2 )nC- | Support 



CH3C-N-CH- 
H ' 

CH 2 o 




o 
11 



N—(CH 2 ) n C- | Support} 



O 



-(C) 



ch 3 c 

rob|> + HOCH-?— ^^-QCH 2 )4 C HSupporj 
CH3O 

dilute acid 




O 



1 CH 3 Ov - 
N CH 2 OCH 2 ) 4 C H Support 

CH3O 



Fig. 7. Proposed chemistries for covalent attachment of DNA or PNA probes to solid supports. Chemically synthesized probes 
can be extended, on either end (shown here on the N-terminal of PNA or the 5' end of DNA). Further variations of the proposed 
chemistries are readily envisaged. (A) An amino group on the probe is modified by bromoacetic anhydride; the bromoacetyl 
Function ris'capture3~by a thiorgrbup on the support (B) An N-acetyl, S-tritylcysteine residue coupled to the end of the probe 
provides, after cleavage and deprotecuon, a free thiol which is captured by a maleimido group on the support (C) The probe 
contains an oligo-tryptophanyl tail (n = 1 to 3), which is captured after treatment of a HAL-modified support with dilute acid. 
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(iv) Immobilization of individually synthesized DNA oligonucleotides or PNA oligomers onto solid 
supports ^ 

(a) Synthesis of24-mer DNA oligonucleotides and 24-mer PNA oligomers. The five sequences listed 
in Table 2 are designed for testing the hybridization properties of zip code arrays. They will be synthesized 
by Core B as DNA oligonucleotides using standard phosphoramidite chemistries [29], and incorporating an 
"aminolink" group at the S'-terminus. In addition, the same sequences will be synthesized as PNA 
oligomers by stepwise Boc or Fmoc solid-phase chemistry, or by a segment condensation approach using 
suitably protected PNA tetramers. The PNA will have one endgroup blocked (e.g., acetyl on N-terminus, or 
amide on C-terminus), and the other terminus extended with e-aminocaproic acid to provide a free aliphatic 
amino or carboxyl site for ultimate linking to the solid support. The complementary zip code 
oligonucleotide sequences will be prepared with a fluorescent Tarn" group at the 5-end, by Core B. 

Synthetic DNA oligonucleotides or PNA oligomers will be released from the resin supports, 
concurrent with removal of side-chain protecting groups. These modified oligomers will be purified to 
homogeneity by well-precedented polyacrylamide gel electrophoresis (PAGE) or high performance liquid 
chromatography (HPLC: reversed-phase or anion-exchange) procedures. Immobilization to solid supports 
will follow, as described below. 

(b) Covalent attachment of DNA oligonucleotides or PNA oligomers to solid supports. The purified 
oligomers all contain a free aliphatic amino group at the terminus, which allows attachment to a derivatized 
membrane according to Zhang [40], Other attachment chemistries based on amino group chemistry will 
also be pursued, building on a wealth of precedents for connecting functionalized polymers and proteins to 
each other and to solid matrices [54] (Fig. 6). These procedure can be carried out in series with several 
probes, resulting in site-specific attachments. Once the complementary zip code probes have been 
immobilized, oligonucleotide hybridizations using fluorescently labeled zip codes will be carried out to 
evaluate both capacity and signal to noise ratio, as described in Core B. 

Encouraging results in the experiments outlined above would provide impetus to the exploration of 
additional immobilization ("capture") chemistries, which need to be rapid, specific, and non-destructive to 
the combination of functional groups found in DNA oligonucleotides and PNA oligomers. Our strategy 
involves incorporation, through synthesis, of alternative functional groups at either end of the probe, 
together with modification of the support by a suitable complementary functional group. More specifically, 
we can take advantage of the facile S~-alkylation or Michael addition of thiol groups, or of the reaction in 
dilute acid ofindole moieties with tris(alkoxy)benzyl carbocations (Fig. 7). 

(v) Synthesis of oligonucleotides or PNA oligomers on solid supports and creation of arrays on solid 
surfaces 

(a) Synthesis of 24-mer DNA oligonucleotides and 24-mer PNA oligomers on "shaved" beads. A 
second approach to constructing the arrays required for zip code capture starts with the assumption that 
suitable probes can be assembled and side-chain deprotected with covalent retention on beads used for solid- 
phase synthesis; these beads are then delivered to discrete addresses on a solid surface. General 
considerations have been outlined earlier; we consider this mode to represent a particularly pertinent case 
where application of our "shaving" concept may be critical to success. As before, the five sequences listed 
in Table 2 will be synthesized, either as DNA oligonucleotides (standard phosphoramidite chemistry) or as 
PNA oligomers (stepwise Boc or Fmoc chemistry). 

PEG-PS beads of lOOp, diameter have a normal capacity of approximately 30 pmol, meaning that a 
shaved bead is predicted to hold about 0.1 to 0.5 pmol of final product. This level of material is well within 
the requirements of the subsequent hybridization studies. Chymotryptic "shaving" of a Boc-Trp-Gly-PEG- 
PS sequence generates a free a-amino group from "surface" glycine residues. For PNA synthesis, the 
C-terminal monomer is coupled to form a non-cleavable peptide bond; for DNA synthesis, an N-acetyl- 
~senne~spacer is introduced sothat^hraphoxaimdite- synthe^^ For 
DNA chains upon completion of chain assembly, removal of the usual base-labile side -chain and phosphate 
protecting groups with aqueous ammonia yields the free probe oligomer linked covalently to the outside 
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areas of PEG -PS. -For PNA oligomers, ienzyJoxycarbonyl-type protecting groups will be removed with 
strong acid, e.g., trifluoromethanesulfonic acid (the strategy may change if/when milder protection schemes 
under development at Millipore or by us are established). 

In order to better document the chemistry of chain assembly on "shaved" as well as control beads, the 
synthesis outlined above can be modified by introduction of base-stable, orthogonally cleavable linkers to 
separate the free glycine from the 3' or C-terminal residue. Suitable choices (Fig. 8) include acid-labile 
/>-alkoxybenzyl (PAB), photolabile o-nitrobenzyl (ONb), or Pd(0)-cleavable allyl (Al) [47, 48]. The 
oligonucleotide or PNA products can be released from the support in a discrete step, and evaluated by 
standard analytical criteria as well as solution hybridization with the complementary sequences. 



HO- CH 2 OCH2C-NHCH2C 1 solid surface 



? 



PAB 



0 2 N V 



HO- CH 2 C— NHCH^S solid surface 



ONb 



O 



O 

II 



HO- CH 2 - CH= CH- CH 2 - OCH 2 C-NHCH£ solid surface 



- Al 

Fig 8. Handles, for attachment of oligopolymers to "shaved" beads. Handles are coupled through their carboxyl groups to 
"shaved" beads. The free hydroxyl on the left side of each structure can be esterified with the C-terminal PNA monomer or 
phosphitylated with a nucleoside phosphoramidite. 

(b) Attachment of beads to solid surfaces. DNA or PNA synthesis on PEG-PS using automated 
instrumentation is generally carried out with several hundred thousand to several million beads, setting the 
stage for economical mass production of arrays. Each 200(1 square pixel will contain several 100*1 beads, 
and different spatial addresses will contain different beads. Several of our industrial collaborators are 
actively pursuing concepts for attaching beads to solid surfaces in a rapid and automated manner. (Please 
see letters of collaboration from Dr. James Coull of Millipore and Dr. Ronald Cook of Siris Labs). Briefly, 
these concepts include precise positioning of the bead over a gridded surface and melting one surface to 
attach them, using grooves or dimples in the surface to help position beads (using vacuum suction to guide a 
bead into a particular position), and/or projecting the beads onto a surface containing glue. For academic 
purposes, we will place beads onto a thin layer of some bonding material, such as epoxy. All bonding 
materials will need to be tested for resistance to high temperatures/high salt conditions, and to confirm the 
absence of non-specific binding to DNA oligonucleotides. Oligonucleotide hybridizations using 
fluorescently labeled complementary zip codes will be used to evaluate both capacity and signal to noise 
ratio as described in Core B . 



(c) Variations. As necessary, the evolving literature methods for simultaneous syntheses of peptides 
or oligonucleotides at defined positions will be adapted in concert with the chemistry (linkers, protection 
strategies) and other concepts (shaving) presented above for beads. For example, it will be of interest to 
learn whether shaving of membranes, PEG-modified polyethylene surfaces, or pins helps with synthesis 
"and/or hybridization efficiency. The varlduslitefature protocols Tor multiple m syathe^5~&re quite* labor- 
intensive, but they may need to be pursued should we be unable to devise successful way to glue beads that 
contain DNA or PNA probes to surfaces, and to apply such materials for hybridization. 
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(vi) Direct synthesis of PNA arrays by masking/segment condensation on solid supports 

(a) Perspective. In principle, arrays can be constructed most effectively by use of highly accurate 
masking and unmasking technology with per cycle yields of close to 100%. Unfortunately, current 
chemistries proceed in at best 97% yield per step, with a possible further drop-off as chain length increases. 
These relatively low efficiencies allow for construction of modest arrays in the octamer to decamer size 
range (even so, with substantial synthetic error rates which translate to hybridization at false addresses), but 
preclude construction of 24-mers needed as complementary zip codes in the cancer detection scheme of this 
program project proposal. 

In the following, we propose a novel way to circumvent the aforementioned problems. The design of 
zip code arrays (in which individual arrays have substantial differences to minimize any chances of cross - 
reactivity; see earlier Fig. 5 and Table 1, and accompanying discussion) has been integrated with the 
synthetic strategy. Rather than carrying out stepwise synthesis to introduce bases one at a time, we use 
protected PNA tetramers as building blocks. (As shown below, these are easy to prepare; the corresponding 
protected oligonucleotide intermediates would require additional protection of the internucleotide phosphate 
linkages.) Construction of the 24-mer at any given address requires only six synthetic steps, with a likely 
improvement in overall yield by comparison to stepwise synthesis. Moreover, since failure sequences at 
each address are shorter and lacking at least four bases, there is no risk that these will interfere with correct 
hybridization or lead to incorrect hybridizations. This insight also means that "capping" steps will not be 
necessary. 

Masking technology will allow several addresses to be built up simultaneously, as is explained below. 
As direct consequences of the manufacturing process for the arrays, several further advantages are noted. 
Each 24-mer address differs from its nearest 24-mer neighbor by three tetramers, or at least 6 bases. At low 
salt, each base mismatch in PNA/DNA hybrids decreases the melting temperature by 8°C. Thus, the T m for 
thecorrect PNA/DNA hybridization is at least 48°C higher than any incorrect hybridization. Also, 
neighboring .24-mers are separated by 12-mers, which do not hybridize with anything and represent "dead" 
zones in the cancer detection profile. Finally, by choosing PNA addresses, we create rugged, reusable 
arrays. 

The remaining description indicates methods for preparation of 36 unique PNA tetramers, and shows 
the mechanical/chemical strategy to prepare the arrays. Pilot experiments will result in the creation of a 5x5 
array with 25 addresses of PNA 24-mers. Ultimately, all 36 tetramers can be incorporated to generate full- 
size arrays of 1,296 addresses. 

(b) Synthesis of protected PNA tetramer building blocks. For each of the 36 unique sequences that 
have been designed (Table 1), we require the intermediates with appropriate protection on the a -amino 
group, on the side-chains, and with a free a-carboxyl group. This can be done readily by Fmoc chemistry 
on PAB or HAL resins, according to standard protocols published from our laboratory [50, 55]. Following 
cleavage in dilute acid, the protected intermediates will be purified by chromatography. Alternatively, we 
can use Boc chemistry with ONb (photolabile) or allyl (cleaved by Pd(0)) resins [47, 48, 56]. 

(c) Construction of PNA arrays. As stated already, only the pilot study with a 5x5 array is described. 
Considerations concerning the solid support are the same as described earlier: starting surfaces will contain 
free amino groups ("shaved" if necessary), a non-cleavable amide linkage will connect the C-terminus of 
PNA to the support, and orthogonal side-chain deprotection must be carried out upon completion of segment 
condensation assembly in a way that PNA chains are retained at their addresses. A simple masking device 
has been designed that contains 200|i spaces and 200^1 barriers, to allow each of 5 tetramers to couple to the 
solid support in distinct rows (Fig. 7). After addition of the first set of tetramers, the masking device is 
rotated 90°, and a second set of 5 tetramers are added. This can be compared to putting icing on a cake as 
rows, followed by icing as columns. The intersections between the rows and columns will contain more 
icing*, likewise, each intersection will contain an octamer of unique sequence. Repeating this procedure for 
a total of 6 cycles generates 25 squares containing unique 24-mers, and the remaining squares containing 
common 12-mers (Fig. 10). 
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Addition of different PNA 
tetramers to each reaction 
chamber. Cycles 1, 3, and S. 



Addition of different PNA 
tetramers to each reaction 
chamber. Cycles 2, 4, and 6, 




Fig. 9 . Process for manufacturing an array (see following two pages for Fig. 10 and legends). 
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Fig. 10. Schematic cross -sectional view of synthesis of addressable array (see next page for legend). 
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Fig. 9. Process for manufacturing an array (legend). (A ). Side view of reaction chambers. (B) Three-dimensional view of 
reaction chambers. Each wall and spacer is 100|i thick. These spacers form chambers of width 100^. The multi-chamber device 
is pressed onto the membrane or activated solid surface, forming tight seals. The barriers may be coated with rubber or another 
material to avoid cross contamination from one chamber to the next. One must also make sure the membrane or solid support 
surface is properly wetted by the solvents. The membrane can be in a vertical position with a plate on each side to clamp the 
multi-chamber device to the membrane. Solvents are introduced at the bottom, rise up the chamber, and are removed from the 
top, much like an ABI four channel DNA synthesizer, except now there are 36 chambers and 36 different tetramer bottles. One 
proceeds by activating the surface, deprotecting, and adding a tetramer. The chamber is undamped, the membrane is rotated 90°, 
and reclamped. A second round of tetramers are added. (C) Schematic bird's eye view of PNA oligomer array after completion 
of all 6 rounds of synthesis. Each tower represents 100 fmole of oligomers. Taller towers represent full size 24-mers which result 
from 6 rounds of synthesis in alternating directions. Each 24-mer tower represents a unique PNA sequence. Smaller towers 
represent half-size 12-mers which result from 3 rounds of synthesis in the same direction. All smaller towers in the same row as 
the arrows are of identical sequence. For clarity, the towers have been drawn as individual units, even though in the "real" 
synthesis they will be the same dimension as the grid squares and thus appear fused to each other. A "side view" of these 
individual towers is shown in Fig. 10. 

Fig. 10. Schematic cross-sectional view of synthesis of addressable array (legend). (A) Attachment of flexible spacer (linker) to 
surface of array. (B) Synthesis of the first rows of PNA tetramers. Only the first row, containing tetramer 1, is visible. The 
multi-chamber device is placed so that additional rows, each containing a different tetramer, are behind the first row. (Q 
Synthesis of the first columns of PNA tetramers. The multi-chamber device has been rotated 90°. Tetramers 9, 18, 7, and 12 
were added in adjacent chambers. (D) Second round synthesis of the PNA rows. The first row contains tetramer 2. (E) Second 
round of synthesis of PNA columns. Tetramers 34, 11, 14, and 23 were added in adjacent chambers during the second round. 
(F) (Not shown) Third round synthesis of PNA rows. The first row contains tetramer 3. (G) Structure of array after third round 
synthesis of columns, adding tetramers 16, 7, 20, 29. Note that all 24-mer PNA oligomers within a given row or column are 
unique, hence achieving the desired addressable array. Since each 24-mer differs from its neighbor by three tetramers, and 
tetramer differs from each other by at least 2 bases, then each 24-mer differs from the next by at least 6 bases. Each mismatch 
significantly lowers T m , and the presence of 6 mismatches in just 24 bases would make cross hybridization unlikely even at 35°C. 
Note that the smaller 12-mer sequences are identical with one another, but are not at all common with the 24-mer sequences. 
Even though the particular 12-mer sequence may be found within a 24-mer elsewhere on the grid, for example 17-1-2-3-28-5, a 
zip code will not hybridize to the 12-mer at temperatures above 50°C. 

Our design for a masking device is essentially the same as the masking technique developed by 
Maskos and Southern [33, 36]. This device will facilitate the desired array synthesis, and allow us to move 
on to testing zip code hybridization with Core B. The masking device will be designed and prepared by our 
industrial collaborators Dr. Ronald Cook of Siris Labs, and Dr. James Coull of Millipore. Simultaneously, 
we will also create, by hand, test arrays on membranes with aid of the Biorad dot blot apparatus containing 
individual microtiter wells. 



E. PROGRAM ASPECTS 

We are developing solid-phase methodology which will allow multiplex detection o: 
oligonucleotide ligation products that are indicative of cancer mutations. Specific aims of this project 
(Project 5) are: (i) Development and evaluation of solid support materials compatible with chemical 
synthesis of DNA oligonucleotides and PNA oligomers, and compatible with subsequent hybridization 
reactions, (ii) Establishment of methodology for synthesis of spatially addressable arrays of DNA 
oligonucleotides and PNA oligomers, (iii) Demonstration of scope and limitations of zip code concepts. 

The zip code approach, including the key needs for its experimental implementation, arose throug 
extensive discussions between F. Barany, R.P. .Hammer, and G. Barany. The studies described in Project 5 
that are directed towards solid support development and evaluation, with respect to either immobilization of 
pre-synthesized oligomers or their direct synthesis, will interface closely with efforts of Core B to prepare 
the needed DNA and PNA primer, zip code, and complementary zip code sequences, and to carry out 
hybridization assays. A collaboration with Project 3 will provide 5-propynyl-uridine monomers for 
incorporation into either DNA or PNA that may have improved thermodynamic parameters in hybridization. 
Progress in Project 5 on the preparation and application of spatially addressable arrays to detection of LDR 
products will have an immediate impact on the cancer work described by Project 1 and 2, since it will then 



PHS 398 (Rev. 9/91) _ _ Page 

~* ' Number pageTwbsecuD vel y al the "Bottom throughout' (he appUcatioiL"Do~/ior" use suffixes 



34 



Principal Investigator/Program Director (Last, first, middle): BARANY, Francis 



be possible to test clinical samples at a significantly enhanced throughput. We also anticipate considerable 
interactions between Project 5 and Core A for computer-aided design of zip code sequences. 

Addressable array capture will eventually be the preferred method of identifying mutations. In our 
initial examination of the p53 gene in colon, lung, and breast tumor samples we will only look for nine 
different mutations, V157, R175, HI 79, C242, G245, R248, R249, R273 and R282 (See Projects 1 and 2). 
By synthesizing LDR primers with tails of varying lengths we can easily distinguish between these 
mutations using gel or capillary electrophoresis. However, increasing the number of assayable mutations 
eventually makes electrophoretic detection less feasible. This is due to two reasons. First, mutation-specific 
LDR primers should differ in length by two bases for their products to be distinguished by electrophoresis. 
For a large number of mutations to be assayed together, very long primers would have to be synthesized. 
Second, only one mutant signal would be expected for most reactions. Since mutant signals will differ from 
each other by only two bases, minor defects in a gel lane could cause a misreading of the LDR product 
length and incorrect identification of the mutation. Both of these problems are currently overcome in our 
laboratory by using more than one color fluorescent label and internal standards within the same lane. 
However, once the primers become very long (75-100 bases), failure sequences (n-1, n-2) become 
increasingly harder to separate by HPLC or gel purification. In contrast, by synthesizing a unique 24 base 
zip code sequence to each LDR primer, the product can be captured by its complementary zip code at a 
discrete "address" on a two-dimensional array. Failure sequences do not present a problem for either the zip 
code sequence or its complementary address. A fluorecent signal at a specific address, as opposed to a 
specific size, thus indicates the presence of a specific cancer mutation. (See Core B.) 

A reusable addressable array with high capacity and excellent signal to noise specificity would be of 
benefit to several of our collaborators who need to detect large number of mutations. This will aid in the 
detection and identification of: hundreds of microorganisms by identifying 16s polymorphisms (Dr. Carl 
Batt), dozens of 6-lactamase mutations responsible for third generation 6-lactam resistance (Dr. Patrice 
Courvalin), epidemiological studies based on HIV polymorphisms (Dr. Olen Kew), dozens of 
polymorphisms in the E6 and E7 genes of high risk HPV strains (Dr. Saul Silverstein), multiple germline 
mutations in single gene disorders (Dr. Eric Hoffman, Dr. Perry White, and Dr. Emily Winn-Deen), and 
multiple somatic mutations in tumor suppressor genes and oncogenes (Dr. John Kovach, Dr. Michael 
Osborne, Dr. Basil Rigas, Dr. John Sninsky, Dr. Mark Sobel, Dr. Steven Sommer, and Dr. Thierry Soussi). 
Please see letters of collaboration in the overview section of this program project grant. 



F. TIMETABLE 

General: The various aims of this research will be pursued in parallel, with successful results in one arena 
providing impetus for progress on other aspects. The focus of Project 5 is the chemical synthesis of zip code 
DNA and PNA sequences, ideally in spatially addressable arrays, on appropriately optimized solid supports. 
As requisite materials and/or structures become available, they will be tested in relatively short order by 
Core 2. The list below follows a combination of descriptions in "Specific Aims" and in "Experimental 
Design and Methods." 

Task 1. Design and optimization of zip code/address duplexes. 

a. Synthesis of zip code sequences and their complements as DNA, followed by solution annealing studies. 
Months 1 to 12. 

b. Synthesis of zip code sequences and their complements as PNA, followed by solution annealing studies. 
Months 7 to 24. — - - - 

c. Synthesis of 5-propynyl-U monomers (Figures 3 and 4), synthetic incorporation into DNA and PNA, and 
solution annealing studies. Months 12 to 36. 

Task 2. Development and evaluation of solid support materials compatible with chemical synthesis of 
DNA oligonucleotides and PNA oligomers, and compatible with subsequent hybridization reactions. 
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a. Studies with commercially available membranes and literature methods for immobilization of end-group 
modified DNA and PNA. Months 1 to 30. 

b. Studies with "shaved" beads, which will be used for solid-phase synthesis of DNA. Months 1 to 18. 

c. Modifications in our laboratories of surfaces, beads, or membranes with hydrophilic spacers such as 
heterobifunctional polyethylene glycol (PEG) and/or carbohydrates (see Figure 6 and accompanying 
discussion), and further studies. Months 12 to 48. 

d. Development of novel chemistry for covalent immobilization of synthetic DNA or PNA (Figure 7). 
Months 6 to 30. 

e. Preparation and segment condensation of protected PNA tetramer building blocks (including 
optimization of protection scheme and coupling conditions), to build up 24-mer complementary zip code 
sequences which will be released into solution following chain assembly. Months 12 to 48. 

£ Segment condensation using protected PNA tetramer building blocks to build up 24-mer complementary 
zip code sequences which will be deprotected but retained on suitable solid supports for subsequent 
hybridization reactions. Months 36 to 60. 

Task 3. Establishment of methodology for synthesis of spatially addressable arrays of DNA 
oligonucleotides and PNA oligomers. 

a. Adaptation of commercially available membranes and literature methods for immobilization of end- 
group modified DNA and PNA, in tandem with spot methods and/or masking technology, to prepare and 
test relatively small arrays. Months 12 to 48. 

b. Application of additional advances from Task 2 towards generation of spatially addressable arrays (e.g., 
"gluing" of "shaved" beads to solid surfaces, direct masking/segment condensation on solid supports as 
outlined in Figures 9 and 10). Months 12 to 60. 

G. HUMAN SUBJECTS / VERTEBRATE ANIMALS: Not applicable 

L CONSULTANTS/COLLABORATORS: Project 5 represents a collaboration of Dr. George Barany 
(Principal Investigator), University of Minnesota, Dr. Robert Hammer, Louisiana State University, and Dr. 
Francis Barany, Cornell University, Medical College. In addition, we have excellent connections with 
leading industrial laboratories that are at the forefront of developing and commercializing methodologies for 
preparation of PEG-PS supports and functionalized membranes, PNA synthesis, and oligopolymer array 
construction. Specifically, we are collaborating with the team at Millipore led by Dr. James Coull, and a 
start-up company named Siris that is headed by Dr. Ronald Cook (see supporting letters). Recently, Dr. 
Michael Egholm, first author of several of the seminal papers on PNA, joined Millipore as a research 
chemist7 Dr. Derek Hudson, a long-time collaborator of Dr. George Barany (several joint publications), is 
currently at Siris. Letters and Biographical Sketches for collaborators are attached in the overview section 
of this program project grant. 

J. CONSORTIUM/CONTRACTUAL ARRANGEMENTS: Please see following page. 
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OUTSIDE OPINION OBTAINED 
Administrative Note: Page 19. 

Resume: Funds are requested for this application that describes a series of exciting 
technological advances for detecting high sensitivity mutations that could have 
a profound impact on the diagnosis, and eventually therapy, of human cancer. The 
application describes several clinical situations where high resolution detection 
of mutations in general, and base substitution mutations in particular, could have 
a major impact on the early detection and diagnosis of cancer, prognostication 
based on micrometastases at the time of diagnosis, and the early detection of 
recurrence. The strengths include the potential of the proposed studies, and the 
high capability of the investigators to carrying out the proposed studies. The 
weaknesses include the overly ambitious nature of the application, a lack of 
experience of the investigators in cancer related work, a lack of the proposed 
clinical correlations, a lack of biostatlstlcal consultation in the design of the 
studies, and the serious flaws involved in the proposed pilot studies of the clinical 
utility of the approach. If considered solely as an exercise in technology 
enhancement this proposal could have ranked 1n the outstanding range. This program 
project is rated at a very good to excellent level of merit. 
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OVERALL DESCRIPTION (Applicant's description) 

The long range objective of this proposal is to develop sensitive and specific 
approaches to the detection and stimulanteous identification of cancer-related, 
genetic alterations. Mutations and genetic aberrations have been implicated, at 
various steps, in the etiology and biology of tumors. Inherited mutations account 
for the predisposition to cancer in some families. Somatic mutations in tumor 
suppressor genes, oncogene amplification and viral DNA sequences have been found 
in cancers as well. However, the clinical use of these discoveries and research 
into their clinical significance has been slowed by the laborious process by which 
they are detected. To apply these discoveries and explore the interactions of 
multiple genetic alterations, we urgently need a new technology, which 1s capable 
of being automated and has the power to detect any of a vast number of mutations. 
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In response to the urgent need for new methods of mutation detection, we have 
assembled a team of investigators whose expertise will be directed toward innovative 
solutions to this problem. The collaborative nature of the scientific and 
organizational inTrastructure will facilitate the attainment of the projects 1 
specific aims and objectives. 

The specific aims of the five projects in this program project are to: (i) develop 
a multiplex polymerase chain reaction/1 igase detection reaction (PCR/LDR) system 
for the detection of inherited mutations in gennline DNA and somatic mutations 
in tumors; (ii) develop a ligase detection reaction/polymerase chain reaction 
(LDR/PCR) system for detecting gene amplifications and deletions in tumors; (iii) 
develop a PCR/restriction /LDR (PCR/RE/LDR) system for detecting and identifying 
mutations in rare cancer cells at a sensitivity of 1 in 106 or 1 in 107 by removing 
normal DNA sequences and selectively amplifying cancer mutations; (iv) design and 
synthesize nucleotide analogues for converting specific DNA sequences into 
restriction endonuclease recognition sites for PCR/RE/LDR mutation detection; (v) 
engineer a thermostable ligase with greater fidelity to enhance LDR and LCR 
specificity; (vi) design and synthesis oligonucleotide or peptide nucleic acid 
(PNA) addressable arrays for the simultaneous detection of multiplex LDR and LCR 
products; and (vii) explore the ability of these technologies to further our 
understanding and clinical management of lung, colon, breast and cervical cancers. 

OVERALL CRITIQUE 

The goal of this program project is to develop sensitive PCR/LCR and very sensitive 
PCR/RE/LCR base substitution mutation tests which could be used to screen, in 
multiplex, large numbers of tumors for important mutations related to cancer 
(PCR/LCR) and to screen large numbers of cells for the presence of these mutations 
(PCR/RE/LCR). The technology development effort in Project 1, together with Project 
2 could be outstanding, although the separation of the two projects appears 
artificial. 

The enthusiasm for Project 1, led by Dr. Wilson, is tempered by an inadequate 
discussion of potential false positive results. More important, is the weakness 
of the proposed clinical correlations. The experimental plan for these correlations 
is diffuse and poorly described. The screening of 90 tumors by PCR/LCR does not 
permit sufficient statistical power to allow for any correlations with clinical 
outcome and no acceptable plan is put forward to achieve this end. Given that 
the diagnosis of recurrence in lung and colon cancer requires examining known sites 
of metastases (e.g. liver, bone ) and the inability of current treatment to 
significantly impact on survival after documentation of recurrence, it is not clear 
how PCR/RE/LCR would permit early detection of metastases in a useful way. A focus 
on the prognostic significance of micrometastatic disease at the time of diagnosis 
or on the detection of known mutations in cells not yet cytologically "malignant* 
would have far greater utility. It is not clear what the second clinical site 
in Denver adds to the overall project. The feasibility of the proposed technologies 
could be tested adequately on the samples already located in New York. Overall, 
this project is rated as very good. 

Project 2, headed by Dr. Francis Barany, is considered to be the stronger of the 
two clinical projects and stronger enthusiasm is expressed for the application 
of the proposed technology to the problems inherent in working with clinically 
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heterogenous tissue such as that found in human breast cancer. As in Project 1 ( 
there is considerably less enthusiasm for the cl inical-correlati ve studies. The 
investigators did.not present a convincing case that they understood the extent 
of the clinical base of materials available and whether they knew how to apply 
their assays even in a preliminary way. The project would have benefitted from 
the up- front collaboration of a biostatistician. The studies proposed for cervical 
cancer, although interesting, are not considered to add anything to the project 
and could be dropped. Overall, the project is rated as excellent. 

Project 3 is directed by Dr. Donald Bergstrom. The proposed design and synthesis 
of candidate convertides and universal bases for development of new and useful 
oligonucleotide diagnostics represent a largely empirical, albeit scientifically 
sound and potentially highly significant, effort. The structures and synthetic 
routes appear reasonable. Promising results have already been obtained and it 
appears that the proposed iterative ■synthesis-testing-synthesis* scheme is feasible. 
In view of the challenging synthetic goals and the lack of a convincing rationale 
for the- proponyl ol igonucleotides, it is recommended that this latter element be 
reconsidered. The potential for this project to have a significant impact on nucleic 
acid based technology leads to an overall rating of outstanding. 

Each of the three specific aims proposed in Project 4 f led by Dr. Francis Barany, 
has the potential to generate important information about the Tth DNA ligase 1n. 
particular, and DNA ligases in general . With respect to the applicability to the 
rest of the program project, the proposed determination of the structure of the ' 

_1igase is co nsid ered to be the -wea kest part of the project. The probability that 
the structure will actually be solved early enough in the project to be applicable 
to the other projects is considered remote, although the work Itself Is of major 

'"importance to the field. The ability of an improved ligase to enhance the 
sensitivity of the LCR/LQR assays, and hence their potential diagnostic value leads 
to an overall rating of excellent for this project. 

In order to carry out the large-scale screening of mutations, the Project Leader, 
Dr. George Barany (Project 5), proposes to develop spatially addressable arrays 
of oligonucleotides or peptide oligonucleotide analogs. Ligase reaction products 
will be constructed with fluorescent groups and will bear specific "zip code" tails. 
The tails will be selectively captured by complementary zip code probes immobilized 
in the array. Each zip code will map for a specific, known, genetic mutation. 
Serious concern is expressed over the abil ity of this workplan to actually deliver 
a working array to the other projects. The project lacks adequate planning in 
hybridization, signal to noise ratios, array fabrication expertise and 
instrumentation development. Dn the other hand, the concept of zip code sequences 
and the potentially elegant contributions to solid-phase chemistry by the project 
leader are considered strong assets. Also recognized, 1s the broad applicability 
of the .technology to. areas . out si.de_of„cancer diagnostics. Overall , the project 
is rated as very good to good. 

Core A, headed by Dr. Niel Hackett, proposes to: (1) create and maintain a relational 
database from existing clinical databases in New York and Denver; (2) perform 
statistical correl at i ons of cl-mi ea-1- out come s-w-i-t h 1 aboratory stud ies-H"3)~devel op 
software for the selection of appropriate primers and zip codes; and (4) continue 
support of both instrumentation and connectivity. All four elements of this facility 
are considered essential; however, the experience of the Core Leader is considered 
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sufficient only for elements 3 and 4. There is no demonstrable expertise in the 
acquisition or analysis of clinical data and this is a major weakness of the overall 
application. Thus a merit rating of this Core is at acceptable level of merit. 

Core B (Dr. Francis Barany) is recognized as an essential part of this program 
project. The reviewers consider that the primary function of this core should 
be to provide oligonucleotide reagents to the rest of the program, and to a lesser 
extent, provide a testing service.. for polymerase fidelity and efficiency of 
nucleotide conversions. This Core would benefit from additional expertise in the 
areas of fluorescence instrumentation and the implementation of robotics. Overall , 
this core is rated as excellent. 

Core C, directed by Dr. Francis Barany addresses the need to coordinate the 
interaction of the applicant organization with six separate consortium institutions. 
The projects are proposed to operate largely independently, with this core focusing 
on comnunication and reporting requirements. Dr. Barany has demonstrated an ability 
to develop substantive scientific collaborations, but evidence of administrative 
systems to promote and monitor these interactions 1s lacking. Dr. Buck, the 
administrative co-investigator, has been recruited on a part-time basis from the 
Strang Clinic to address these systems but his contribution to-date 1s not evident. 
An impressive panel of external advisors has been recruited, but the means to 
internalize their advice and maximize the efficacy of an annual review have not - 
been addressed. A vast number of letters of support are provided that propose 
specific studies unrelated to this grant while providing strong support for the • 
overall concept. The overall rating for this core is good to acceptable. 

Program as an Integrated Effort: Although this program is well integrated and 
has synergy as far as its goals are concerned, the diverse locations of the 
laboratories along with a lack of a plan to hold the frequent meetings between 
the project leaders, on the part of Dr. Francis Barany, hinder the interactions 
between the investigators. However, formulation of a plan by the Principal 
Investigator to hold bimonthly meetings could facilitate such efficient interactions. 

PRINCIPAL INVESTIGATOR 

Dr. Barany received his Ph.D. degree in Microbiology from The Rockefeller University 
in 19B1 where he also spent a year for a postdoctoral training. He then moved 
to Dr. Hamilton Smith's laboratory at The Johns Hopkins University for a three 
year postdoctoral training in Molecular Biology. He returned to New York in 1985 
to assume a faculty position at the Cornell University Medical School where he 
is currently an Associate Professor of Microbiology. He is also an Adjunct Associate 
Professor at The Rockefeller University. His scientific productivity 1s reflected 
by his numerous publications in highly competitive refereed journals. However, 
Dr. Barany clearly expresses a lack of experience in dealing with the problems 
posed by human cancer and describes a series of existing and potential collaborations 
to overcome this issue. He does not demonstrate previous experience in managing 
a project of this magnitude but his ability to pull together and provide leadership 
to this program has led the reviewers to believe that he is well qualified to serve 
as the Principal Investigator. 
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INDIVIDUAL PROJECTS AND CORES 

Project 1: Genetic Markers of Lung and Colon Cancer 
(Vincent Wilson, Ph.D.) 

Description : (Applicant's description) In the last decade mutations in many 
oncogenes and tumor suppressor genes have been described in cancers. This knowledge, 
however, has not significantly changed the care of cancer patients. Do cancer 
mutations predict the behavior of tumors? To correlate mutations with clinical 
outcomes we need robust methods toj.dentify many possible mutations. Can-the early 
spread of cancer be determined by finding the mutations of cancer cell in the bone 
marrow? To detect micrometastases or early cancers we must be able to detect a 
few cancer cells out of many normal cells. 

To achieve these capabilities we have devised two technologies: polymerase chain 
reaction/1 igase chain reaction (PCR/LCR) to survey tumors for a wide number of 
mutations simultaneously; and polymerase chain reaction/restriction endonuclease 
digestion/1 igase chain reaction (PCR/RE/LCR) for detecting a few cancer cells out 
of many normal cells. When they are fully developed PCR/LCR should be able to 
detect tens to hundreds of mutations at a sensitivity of one in 102 or 103, 
PCR/RE/LCR has already detected one mutation-bearing cell out of 107 normal cells! 

To demonstrate the feasibility of these methods our specific aims are to: (i) Develop 
a PCR/LCR multi-gene, multi-mutation detection system to simultaneously identify 
mutations in three condons of the k-ras oncogene and nine Condons of the p53 tumor 
suppressor gene. Approximately half of colon cancers have these k-ras mutations. 
About 15 percent of lung tumors and about 21 percent of colon cancers have one 
of these nine p53 mutations. Using PCR/LCR to identify these mutations we will 
investigate 40 colon and 50 lung tumors; (ii) Refine PCR/RE-/LCR to detect the above 
p53 mutations at sensitivities of one in 107. We will first use PCR/RE/LCR to 
determine the natural background mutation rate in non-cancerous tissues. Then, 
for patients whose tumors had detectable p53 mutations, we will use PCR/RE/LCR 
to investigate lymph nodes, blood and bone marrow specimens for micrometastases. 

CHtifluf: The project leader proposes to amplify genomic fragments from cancer- 
related genes using PCR and to rely on the allele specificity of the ligase chain 
reaction (hence PCR/LCR) to detect specific point mutations. Additional selection 
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is proposed to be added during the PCR ampl if ication byway of restriction 
endonuclease cleavage of the wild type sequences (PCR/RE/LCR). 

The oroject leader makes a compelling case that LCR (or LDR in Project 2) is a 
iood method for detecting mutations in clonal tumor populations or in tissue samples 
fn whfch the tumor cells represent 0.1-1 percent of the cells. Ease of multiplexing 
was ; put forward as the major advantage of LCR over allele-specif ic PCR. 
Nevertheless, unlike LDR in project 2. no preliminary data « presented for PCR/LCR 
with N>2 For the work scheduled for years 1-3, the project leader has selected 
mutations in 3 codons of k-ras (20 mutations) which will detect mutations in about 
Z £r?Int of colon and an undisclosed number of lung cancers, and Nations in 
9 codons of P 53 found in 15 percent of lung and 21 percent of colon cancers. PCR/LCR 
would be applied to 40 colon and 50 lung tumors from a more than adequate supply 
of available specimens. It is not clear how these specimens would be selected. 
No power calculations are given to assess the effort needed to extract clinically 
relevant data (e.g. prognosis) from this type of survey. It is also disappoint ng 
JhlrSo calculation is given as to the effort necessary to develop a multiplex 
PCR/LCR system capable of detecting mutations in a large fraction (e.g. 95 percent) 
of either lung or colon cancers, a prerequisite to screening or other applications 
of PCR/RE/LCR. Also, the discussion of the use of readily available fixed specimens 
is inadequate. 

The project leader has also made a compelling case that PCR/RE/LCR could be developed 
in vears 2-5 into a highly sensitive method for detecting mutations in tissue samples • 
"SFere tumor cell s" represented appVorimately I/tt 6 cells.-The compet^g-technology 
of PCR/RE/allele-specific PCR is not compared, although the previous advantage 
of potentially easy multiplexing in PCR/LCR would be lost in PCR/RE/LCR No 
exolanation was proffered as to why dilution of mutant plasmid into wild type plasmid 
led to decreased signal intensity (p. 218 of the application) whereas dilution 
of 1-10 cells containing mutant DNA into wild type cells led to constant signal 
intensity (p. 220). Furthermore, the project leader stated that experiments with 
1 cell are routinely positive. Given the required sensitivity and discrimination 
of these experiments, data showing detection as a function of PCR cycle number 
for the various ampl if icat Ions would have been reassuring. Although he is an expert 
at applying PCR technology, the discussion of contamination is inadequate for the 
task at hand. PCR/RE/LCR 1s equivalent to sperm typing, where heroic efforts have 
been needed to eliminate false positives from amplification product carry over. 

A discussion of potential false positives due to DNA polymerase errors is included, 
but the assumed DNA polymerase error rate is at least an order of magnitude lower 
than the cited literature. Ligase may be contributing of the order of 100-fold 
to the discrimination. Nevertheless, the results (Fig. 4 a 6) for PCR/RE/LCR at 
Hspl sites, which contain only dG and dC and which are copied by Taq polymerase 
"with greatest f idel ity, might suggest that the same degree of PCR/RE discrimination 
could not be achieved for the majority of mutations. 

The proposal to examine various clinical samples for micrometastases, except as 
applied to establishing initial staging, appears to be misdirected to tissues 
unrelated to metastatic spreao. 

Intrinsic Scientific Merit Score: 221 
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Personnel : 

Name: Vincent Wilson 

Degree/Discipline'/Date: Ph.D., pharmacology & toxicology, 1980 
Role/Percent Effort: Project Leader, 20 percent 

Qualifications/Experience: Dr. Wilson was a Senior Staff Fellow in the Laboratory 
of Human Carcinogenesis, NCI (1982-1988) where he published extensively with Dr. 
Curtis Harris on DNA adducts. His recent and independent publication record has 
not been particularly impressive. The work on which this study is based derives 
from his SPORE pilot project and his collaboration with the applicant. They have 
developed a highly sensitive method for detecting point mutations in one cell in 
millions, which if generalized could be important for cancer biology. Dr. Barany 
credits Dr. Wilson with the original concept of the application. 

Assessment in designated role: highly qualified 

Name: Leonid L. Reznikov 

Degree/Discipline/Date: M.D., andrology and uroloncology, 1986; Ph.D., laser 
medicine, 1990 

Role/Percent Effort: Postdoctoral fellow, 100 percent 

Qualifications/Experience: His recent publications, In Russian, have all been in 
urology. He has no background in molecular biology. 

Assessment in designated role: qualified. 

- Budget : — The -modest-budget- is approved as. requested. ~- - 

Assessment : Level of merit; very good 

Project 2: Genetic Markers of Breast and Cervical Cancer 
(Francis Barany, Ph.D.) 

Description : (Applicant's description) To improve cancer care, researchers and 
clinicians need robust methods of identifying genetic alterations in cancers. 
There are three important challenges that need to be met: (1) the detection of 
many possible point mutations in tumors; (ii) the quantification of gene 
amplifications and deletions in tumors; and (iii) the detection of rare cancer 
cells against a background of normal cells. Researchers need these capabilities 
to be able to correlate multiple genetic alterations with clinical outcomes, identify 
new cancer-related genetic loci, and detect early cancer recurrence and premalignant 
cell. 

To accomplish this, we will develop: (i) a multiplex polymerase chain reaction/1 igase 
detection reaction (PCR/LDR) system to detect many possible point mutations in 
cancers; (ii) a<j multiplex ligase detection react ion/polymerase chain reaction 
(LDR/LCR) system to quantify gene ampl if i cat ions and deletions in tumors; and (iii) 
a polymerase chain reaction/restriction endonuclease/1 igase detection reaction 
(PCR/RE/LDR) to identify 1 cancer cell in 106 normal cells. 

Some issues in cancer will be explored. Specifically we will : (1 ) Expand PCR/LDR 
to detect 24-40 point mutations, (63% to 79% of p53 gene mutations) in breast tumors. 
PCR/LDR will also be used to detect high risk human papillomavirus (HPV) in cervical 
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lavaoes or biopsies. (11) Use "zip code' primers to proportionally PCR amplify 
and Quantify LDR products of genes deleted or amplified in tumors This should 
alloS us to simultaneously detict HER-2/neu and int-2 gene amplifications as well 
as D53 gene deletions in breast tumors, (ill) Refine PCR/RE/LDR for detecting 
f vr D 53 mutations by selectively amplifying mutated DNA while removing wild-type 
oroducts by Taql restriction endonuclease cleavage. We will then characterize 
III gene nwtitions and deletions. HER-2/neu amplifications and int-2 amplificat ons 
in 100-200 frozen breast tumors and corresponding fixed specimens We will be 
use PCR/RE/LDR to look for micrometastases in the bone marrow and lymph nodes of 
clients whose tumors had one of the five specific p53 mutations. Ultimate y, 
this sensitivity method may identify early relapses or primary tumors by detecting 
circulating cancer cells in the blood. 

Critioue: In Specific Aim 1. a multiplex PCR/LDR system will be developed to detect 
Satins in the p53 gene and to detect high risk HPV strains in clinical samples. 
tSp Vt^nath of the project is that rapid methods to detect single base pair 
I aUonsVre much needed'in the clinical arena, and the PCR/LDR and the PCR/RE/LDR 
methods may prove useful for this purpose. From a technical point of view the 
boundaries between this project and Project 1 are artificial; this project is 
S?s?inSuished from Project 1 primarily by the fact that Project 1 concentrates 
on LCR methods while this project concentrates on LDR methods . The three specific 
aims entail PCR/LDR of tumor biopsies and Human Papilloma Virus (HPV), LDR/PCR 
for measuring gene amplification, and PCR/RE/LDR for detecting mutations at a 
sensitivity of 1 in .10 6_or 10 7. In.contrast to project .1, the target genes are 
directed towards genes involved in breast cancer (and HPV infection associated 
with cervical cancer), with an overlap with project 1 in regard to detecting some 
p53 mutations. The project leader hopes to identify new correlations with prognosis 
and the mutations revealed. 

The oroject leader has chosen to initially examine five different mutation sites 
within the p53 gene (constituting about 28X of reported p53 mutations) because 
these are known sites for TaqI conversion using PCR/RE/LDR. This specific aim 
proposes to develop the method for large screen testing using a pilot panel of 
100 to 200 breast cancer cases from frozen or fixed specimens. However, no 
preliminary data was shown using a sample with a known mutation at one of these 
sites in either frozen or fixed breast cancer samples. Furthermore, It is unclear 
from the letters of collaboration, the nature of the corresponding follow-up clinical 
information on these patients. The letter of collaboration from Dr. Koyach states 
that 75 known p53 mutation sites-containing samples would be made available, but 
it is not clear which of these contain the five p53 mutations chosen for study. 
The letter from Dr. Summers does not state any sample information, but appears 
to be referring to the wrong grant application. However, it is apparent that the 
well -characterized tumor bank available thr.ough Dr. Osborne is a. valuable resource 
which will be made available to the investigators. 

This project refers to project 4 because improvements in the buffer conditions 
or ligase may be needed in order to increase the sensitivity of PCR/LDR to 1 cancer 
gene-mutat-ion-in-10 2 to-10 3 normal cells. If one accepts that the need for further 
optimization of pH , salt concentration, incubation time and temperature that give 
maximum fidelity for Tth ligase in LDR assay, and thus LCR (see page 356 of project 
4), then the experiments proposed seem premature. It is most important to optimize 
the signal before performing complex experiments. 
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Another issue is That the investigators have chosen the p53 gene because of the 
frequency with which mutations have been detected in tumors, and it is a valuable 
test system for developing the methods of PCR/LDR and PCR/RE/LDR. But it is 
questionable whether the investigators will be able to correlate their findings 
due to their current lack of integration with the clinical database and their lack 
of statistical power. Even very good prognostic markers with hazard ratios of 
1.3 to 1.4 and rare mutations rates (for instance of 8 percent) would require 1000 
to 1700 specimens to see differences 1n disease free survivals (in node-negative 
disease) after five years. Additionally, there is no discussion of statistical 
analysis in the project; a statistical collaborator would strengthen this project- 
It is of concern that cellular heterogeneity inherent 1n breast cancer has not 
been adequately addressed; this may be problematic in PCR/LDR assays. It would 
probably be best to develop Specific Aim 3 first, and increase sensitivity, before 
screening large numbers of samples outlined in Aim 1. 

The project leader has also included studies to detect high risk HPV strains in 
cervical carcinomas. It is felt that the inclusion of this study, when the rest 
of the project is devoted to the detection of breast cancer-specific changes, makes 
the project diffuse. It is suggested that this study be omitted. 

The arguments for gene quantification as a method to look for ploidy changes are 
quite convincing. However, figure 7 in the application is a multiplex experiment 
in which the ratios should be~l : 1 but they are not; this is not discussed by the 
Project leader. It is stated that 'the prognostic significance of gene 
amplifications (e.g. HER-2, c.-myc, and int-2) in breast cancer has not been clearly 
established..)". There are numerous studies demonstrating the clinical utility 
of HER-2 in node-positive disease, and the lack of utility of int-2 as a prognostic 
marker. It is not understood why int-2 has been chosen for study due to its low 
level amplification rate in breast cancer (less than 15 percent) and due to the 
fact that the protein product of int-2 is not expressed in breast cancer. Although 
the development of an assay for gene amplification is worthy of study, the 
development of it for genes such as the int-2 gene is both scientifically 
uninteresting, and a poor choice for study. However, the HER-2 gene is an 
appropriate choice for methodology development if HER-2 amplification status has 
been previously determined by conventional methodologies in these clinical samples. 

In Specific Aim 3, PCR/RE/LDR will be developed to detect rare mutations with 
increased sensitivity for the eventual application of occult micrometastasis 
identification; this aim is undoubtedly the strength of the project. The PCR/RE/LDR 
strategy could prove to have a relatively high sensitivity in clinical samples. 
As the investigators point out, the level of error incorporation of Taq polymerase 
- is unlikely to average over 1 in 10 6 cells, and furthermore, the ligase has a 
50x to 500x selectivity against mismatches involving the 3' nucleotide of the 
ligation substrate. Preliminary data would have dispelled these doubts. 

To general ize PCR/RE/LDR, the project Leader suggests conversion of mutation sites 
to include a restriction site by the use of primers with "convertlde' nucleotides 
or nucleotide analogs. The example given is for transitions from CCGG to TCGA, 
the easiest of such conversions. The investigators propose that other harder 
conversions could use nucleotide analogs that permit the introduction of other 
bases in replication (project 3). 
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The investigators in Project 3 may have already found a universal "convertide" 
for this purpose. The other proposed convertides would not now be deemed necessary 
for this project. They would still be interesting as variants at the penultimate 
3 1 base for increased discrimination. It should be noted that there is no reference 
to previous work regarding 'converting" sequences to restriction sites using PCR. 
These include Hruban et al.. Am. J. Path., 1993 and Mitsudomi et al . , 1991. These 
alternative technologies should have been discussed. 

From the point of view of error rate in polymerization or ligation, the examples 
shown Generally represent atypically easy targets for demonstrating this strategy; 
the restriction site is 5'-CCGG (or S'-TCGA with the T and A defined by the primers) 
and it is known that Taq polymerase has its lowest error rate for G:C base pairs. 
The mutations detected are usually transversions, for which the ngase has the 
maximum possible discrimination. This means that the example shown will not reflect 
the typical combination of A:T-containing restriction sites and transition mutations 
that will be encountered in a big survey. The problem of fidelity at some sites 
was acknowledged at the visit. 

Once conversion of a site is contemplated there can be no multiplexing. One should 
also consider whether allele-specif ic ASO (similarly spiked to give quantitation) 
would be equally efficacious, or even more so, after iterated PCR/RE. Such a PCR 
step could take place under maximum fidelity conditions (such as single stranded 
binding protein and low nucleotide concentrations) which seems to match the fidelity 
of LCR. Specific reasons why this is not a good idea should be given. 

The investigator recognizes that in many of the cases for which this methodology 
may be useful, the issue will not only be that of whether a mutation is present, 
but also how prevalent this mutation might be. To achieve this the reaction will 
be spiked with a primer containing a different 3* base than the one to be assayed 
and with a different product length. This strategy assumes that the chosen spike 
sequence will never occur- in * tumor -but i s. nevertheless likely to work in many 
cases. 

Regardless of these concerns, a high degree of enthusiasm is expressed for this 
study, independent of the ability of the project leader to correlate the information 
gained with clinical parameters. 

Intrinsic Scientific Merit Score: 183 



Personnel : 

Name: Francis Barany 

Degree/Discipline/Date: Ph.D.. microbiology, 1981 
Role/Percent Effort: Principal Investigator, 15 percent 
Qualifications/Experience: described earlier. 

Assessment in designated role: highly qualified. 

Name:" Matthew B. Lubin 

Degree/Discipline/Date: M.D., medicine, 1984 
Role/Percent Effort: Co-Investigator, five percent 
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Oualifications/Experience: Internship and Residency, and Fellowship in Internal 
Medicine, andMedirtl Genetics, respectively (1987-1990); Clinical Instructor (1990- 
1993); Director of Medical Genetics, and Assistant Professor (1990 and 1993). 

Assessment in designated role: well qualified. 

Name: Darren Day 

Degree/Discipline/Date: Ph.D., biochemistry, 1989 
Role/Percent Effort: Research Associate, 100 percent 

Qualifications/Experience: Postdoctoral Fellow at the University of Southampton, 
United Kingdom, and Auckland University, New Zealand (1990-1992). 

Assessment in designated role: qualified. 

Budget : The requested budget is approved. 

Assessment : Level of merit; excellent. 

Project 3: Design and Synthesis of Nucleotide Analogues 
(Donald Bergstrom, Ph.D.) 

Description : (Applicant's description) A high sensitivity mutation detection- 
system must be able to detect changes in any gene sequence. To be able to accomplish 
this, wild type DNA sequence corresponding to a mutation needs to be converted 
to a restriction enzyme site sc repeated PCR amplification followed by digestion 
with the restriction enzyme removes the normal sequence while selectively amplifying 
the mutant sequence. This amplification, known as PCR/RE/LDR aims to detect one 
cancer mutation in 106 normal cells. 

The goal of this project is to design and synthesize nucleotide analogues which 
facil itate sequence conversion. "Convertides* are nucleoside analogues which pair 
to one or more of the natural bases in an initial primer hybridization. More 
importantly, convertides also function as a degenerate template allowing for 
insertion of different base during subsequent rounds of polymerase amplification. 
There are twelve possible nucleotide conversions which should be achieved. 

To accomplish our goal we will work towards the following specific aims: (i) The 
synthesis of deoxyribonucleoside analogues to be used as convertides. Eight 
deoxyribonucleoside analogues, 02. 05, 06, and Q9-Q13, have been previously 
described. We have already designed nine additional modified deoxyribonucleosides, 
01, 03. Q*. Q7, 08, and 014-017 . All 17 deoxyribonucleosides analogues will be 
synthesized in our laboratories, (ii) Preparation of dimethoxytrityl (DMT)-protect 
derivatives of all the convertides for incorporation into oligonucleotides. In 
the middle of an oligonurleOtiderWT-convertide phosporamidites will be used. 
At the 3' position this will be accomplished by attaching the 3'-hydroxyl of the 
protected convertide to a long chain alkyl amine-CPG support, (iii) Testing of 
convertides for use in the mutation detection techniques. Starting with Q2, Core 
B will test convertide oligonucleotides as a means of increasing the specificity 
of mutation detection and as universal bases for polymorphic sites. (1v) 
Synthesizing and incorporating 5-propynyluridine into DNA or PNA "zip codes - . 
In addressable arrays, this will be tested for optimizing the Tm of the zip 
codes/complementary zip code duplexes (Project 5 and Core B). 
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Critiaue- The first Specific Aim of this project is to synthesize 17 deoxynudeoside 
analogue 's (Q1-Q17) to be incorporated into convertide oligonucleotide probes and 
□rimers- 8 analogues have been previously reported and 9 are apparently new. The 
underlying theory-for the selection or design of each of these particular analogue 
structures seems reasonable. The H-bonding bonding schemes and tautomeric forms 
shown on p. 321 of the application are also reasonable, with the exception of Q4 
and 08 in which oxygen has been replaced by sulfur. It would have been useful 
in each of these two less familiar cases to include supportive thermodynamic 
calculations. The same is true for- several - E-nucTeosides, the duplex-forming 
oroDerties of which are not clear-cut. On the other hand, the applicant s rationale 
regarding the selection or design of Q1-Q17 is supported by his prel ™inary studies 
of 02 that include thermal melting (Tin) measurements to determine if QZ is non- 
discriminatory in base-pairing to A, C, G, and T, as well as investigation of 
seouencing and PCR reactions. While the Tm resulted per se are can only be 
sugqestivi of non-discriminatory base-pairing of QZ, it is very important that 
an oligonucleotide with 02 (but not with mismatches) was already shown by the 
aoDlicant to be functional in primer extension., in PCR and by T7 DMA polymerase 
even with Q2 located at the 3 '-end of a primer, which is a critical locus for the 
proposed convertide mechanism. 

The applicant's preliminary synthetic results for Q14 are likewise supportive. 
Known-target analogues Q12 and Q13, the synthesis of which are said to be in progress 
along with Q5 and Q9, have already been shown by others to be tolerated by Taq 
polymerase when at the 3' end of a primer. This gives this study further credence . 
with regard to feasibility. 

In summary, the structural targets proposed in the first Specific Aim represent 
novel and challenging design and synthesis efforts that are correctly viewed by 
the applicant as being an empirical screening process to find which candidate 
convertides'will actually provide acceptable biochemical "read" and "write" kinetics. 
While this adds some uncertainty to the degree of success achievable with each 
of the 17 convertides presently proposed, there seems to be no other way to approach 
the problem at this stage, since even state-of-the-art molecular modeling can at 
best be used to assess relatively simple H-bonding schemes and duplex stability 
but none of the critical polymerase reactions. 

Regarding synthesis, half of the targets are known and should therefore pose no 
significant difficulties. The new convertides involve largely conventional 
transformations with apparently reasonable literature precedent. These syntheses 
will involve a substantial amount of work but are otherwise unremarkable. 
Alternative synthetic schemes are said to be available, although not written in 
the application. 

The second Specific Aim, which is the preparation of DMT-protected support-bound 
and phosphoramidite derivatives of the convertides, has been adequately addressed 
using relatively straightforward chemistry that should pose no significant problems 
for the applicant's team. Purification and analysis of these oligonucleotides 
is described only very briefly, however, again there is ample precedent for this 
t-ype -of- post-synthetic -work - - — 

The third Specific Aim involves testing of convertides as a means of increasing 
the specificity of mutation detection and serving as functionally useful bases 
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for polymorphic sites, which is work carried out in program projects 1, 2, and 
4. Promising convertides will be further studied by the applicant using conventional 
measurement of thermodynamic parameters. 

The fourth Specific Aim concerns synthesis of 5-propynyl-dU by the Method of Froehler 
in order to obtain modified oligonucleotide with supposedly higher hybridization 
affinity and thus f in principle, obtain greater discrimination in the addressable 
arrays described in Project 5, A similar proposal was made for 5-propynyl uracil 
PNA monomers with either Fmoc or Boc protection. This is an interesting idea in 
the case of dU based on Froehler's published initial work; however, the generality 
and magnitude of this effect in mixed-based sequences has not apparently been 
reported. There is apparently no experimental precedent that the 5-propynyl effect 
will apply to PNA-DNA heteroduplexes, nor is any supporting theoretical rationale 
developed by the applicant. Moreover, no already proven options such as 2 , -0'alkyl 
oligonucleotides are even discussed. While this aspect of the work is far less 
critical to realization of the convertide concept. It nevertheless ought to be 
reconsidered. - 

In surmiary, this is an exciting project based upon the early work of others and 
preliminary promising results obtained by the applicant. It can contribute novel 
compounds for potentially promising and generally useful enhancements to 
hybridization-based detection and a justification schemes proposed in the program 
and possibly beyond. The synthesis of convertide-containing oligonucleotides by 
the present project is clearly critical for the success of the overall program. 
A specific task time-table is not developed indicating who is to make which 
convertides by specified projected dates for delivery to the other projects and 
core components; however, the synthetic work is distributed between Drs. Bergstrom 
and Hammer according to the heterocyclic ring size, and substantial progress is 
said to have been made. In view of the challenging scope of synthesis, and the 
importance of this project to the program, it will be important for the project 
leader to closely monitor and direct as needed, the work that is proposed to be 
conducted in Dr. Hammer's laboratory. This project can conceivably have far reaching 
and substantial impact on nucleic acid-based technologies beyond the scope of the 
program. 

Intrinsic Scientific Merit Score: 139 

Personnel : 

Name: Donald E. Bergstrom 

Degree/Discipline/Date: Ph.D.. organic chemistry, 1970 
Role/Percent Effort: Project Leader, ten percent 

Qualifications/Experience: Professor of Medicine and Chemistry at Purdue University 
since ! 989 and Deputy Director of the Purdue Cancer Center since 1992; he has served 
on the editorial board of Nucleosides and Nucleotides since 1992 and has a productive 
publication record in synthetic aspects of that field. 

Assessment in designated role: highly qualified. 

Name: Robert P. Hammer 

Degree/Discipline/Date: Ph.D., organic chemistry, 1990 
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Role/Percent Effort: Co-investigator, ten percent during nine month academic 
vears/66.7 percent during three month summer 

Oua' 1 If ications/Experiencel Postdoctoral 1990-1992 at ETH with world-renown Professor 
Albert Eschenmoser on project-related chemistry, and Assistant Professor of Chemistry 
at LSU since August 1992; he has only one relevant publication in J. Org. Chem. 
in 1987 and several proceedings abstracts. 

Assessment in designated role: qualified 

Name: Guanygl Wang . 1QO , 

Degree/Discipline/Date: Ph.D. , organic chemistry, 1987 
Role/Percent Effort: Postdoctoral Fellow, 100 percent 

Qualifications/Experience: Postdoctoral positions at the University of Arizona, 
UniversUy of Maryland, and Purdue University, the latter since 1989 with Dr 
Bergstrom; he is listed as a co-author on about a dozen organic chemistry journal 
publications since 1987. 

Assessment in designated role: qualified. 

Name: Peiming Zhang , 
Degree/Discipline/Date: Ph.D., organic chemistry, 1980 

Role/Percent Effort: Research Scientist, 50 percent. . 
Qualifications/Experience: Postdoctoral since 1990 with Dr. Bergstrom; he is co- 
author of only one publication, which is with Dr. Bergstrom on the synthesis of 
a relevant nucleotide analogue. 

Assessment in designated role: qualified. 

Name: Melissa Cothern 

Degree/Discipline/Date: B.S., chemistry, 1993 

Role/Percent Effort: Research. Assistant. 100 percent at no cost 

Qualifications/Experience: No publications 

Assessment in designated role: no proven experience. 

Budoet : The requested budget is modest and is approved. 

Assessment : Level of merit; outstanding. 

Project 4: Engineering an Improved Thermostable Ligase 
(Francis Barany, Ph.D.) 

Description : (Applicant's description) One of the fundamental problems in detecting 
cancers in tissue samples is the need to distinguish a few cells containing the 
cancer mutation from the vast majority of normal cells. We have developed a novel 
polymerase chain reaction/1 igase detection reaction method (PCR/LDR) for nigh 
throughput, low sensitivity mutation detection (1 in 10 2 to 10 3), and a 
PCR/restriction endonuclease/LCDR (PCR/RE/LDR) method for high sensitivity mutation 
-detection (1 in 10 6 to 1 in 10 7, see Project 1 and 2). The enzyme which provides 
the specificity for these methods is Tth ligase, the gene for which was originally 
cloned in our laboratory. The limit of detection of these two methods would be 
significantly improved by increasing the specificity of Tth ligase. 
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We are developing a comprehensive approach to understanding the mechanism of Tth 
ligase action, and improving its fidelity for discriminating perfectly matched 
from mismatched substrates. The three parts to this program are: (i) Developing 
a rapid assay to test different reaction conditions, mutant Tth ligases, and 
demonstrates higher sensitivity then the discriminating base is on the 3 1 end of 
the test primer. Introducing a nucleotide analogue (see Project 3) or mismatched 
base adjacent to or near the discriminating base may increase the specificity of 
this reaction. Such modified oligonucleotide primers will be tested in our fidelity 
assay using wild type and mutant Tth ligase. (ii) Determining the 3-dimensional 
structure of Tth ligase-DNA complex. This structure will help to reveal the 
mechanism of DNA ligation, and provide an understanding of the specificity of the 
enzyme for mismatches at the nicked site. < i 1 1 ) Using site-specific mutagenesis 
to construct mutant Tth ligases. Design of these mutants will be based on protein 
sequence homology and protein-sugar-phosphate backbone contacts as determined from 
the X-ray structure. We have already isolated and partially characterized over 
30 site-specific Tth ligase mutants, and these will be tested in our fidelity assay. 

Critique : The overall goal of this project is to better understand the functional 
domains of Tth ligase and its mechanism of action which will hopefully result in 
an enzyme with improved specificity. Although DNA ligases from both prokaryotes 
and eukaryotes have been studied for many years, there has not been a comprehensive . 
attempt using modern methods of site directed mutagenesis to study their reactions. 

The primary reason for undertaking this program is the LCR and a variation LDR 
which hold great promise as diagnostic methods for detection of specific mutations. 
The project leader is one of the inventors of LCR. Thermostable ligases give 
superior performance in LCR and if the specificity of the ligase could be increased 
the sensitivity of detection of mutations could be increased. In as much as LCR 
or variations of it are central to the cancer detection studies proposed in this 
program, this section is central to the entire program since increasing the 
sensitivity of LCR and LDR is seminal, particularly if multiplexing is to be done. 

The construct of new mutants are based on the previous mutagenesis studies by the 
project leader. These studies have led to identification of the adenylation site 
of Tth ligase in the vicinity of residue 118. These studies confirm a conserved 
motif, KVDG, in Tth ligase, suggested by the other workers, to be important in 
adenylation. Other residues are also identified in these studies which are necessary 
for ligase activity but not for adenylation or deadenyl ation. The ability of the 
cloned thermostable Tth ligase to complement an E. coli host containing a temperature 
sensitive mutant 1 igase permits a very nice in vivo assay for functional ity. Also, 
the thermostability of the Tth ligase makes possible a relatively simple partial 
purification procedure for the ligase which permits in vitro characterization of 
the reactions of the mutant enzymes. Finally, a sensitive fluorescent assay for 
1 igase f idel ity has been developed using an Appl ied Biosystems (ABI) DNA sequencer 
employing the ABI Genescan software which permits the detection of products in 
the range of 100 attomoles. Preliminary studies have already led to the interesting 
observation that Tth 1 igase shows greater specificity for perfectly matched substrate 
over mismatched substrate when the mismatch is on the 3' side of the nick. 

Rationale for the design of mutants, particularly in the absence of a structure, 
is a key issue. At this time, the project leader has only sequence comparisons 
and consideration of conserved amino acids to go on. It is noted that no 
consideration seems to have been given if large numbers of mutants were to be 
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screened. This might only be possible by genetic means. Considering the biochemical 
"says available, -which make use of the temperature stability of Tth ligase, only 
up to several thousand of mutants could be screened in a reasonable time but not 
Millions However the project leader's success on two recently isolated mutants 
? show i^roved spec ficUy for TG mismatches suggest that it is not too difficult 
to isolate relevant mutants. In any event, this is not considered a serious 
criticism since the Site Visit Team has a great deal of confidence in the project 
leader's ability to recognize and deal with this problem 1f it occurs. 

Overall the proposed site-directed mutagenesis studies are excellent. Clean and 
sensitive assay methods have been developed that should, at the very least, lead 
to much important information about the functional domains of Tth 1 igase and appear 
to have a good probability of yielding mutants with increased specificity for at 
least some mismatches. 

The second Specific Aim entails the testing of modified oligonucleotides for improved 
specificity during ligation. The-approach is to test the effect of these 
modifications on the specificity of ligation when they occur adjacent to and one 
base over from the discriminating base. If modifications can be found that increase 
this specificity, then potentially the sensitivity of LCR/LDR, and hence their 
diagnostic value would increase. To be tested are base transposing agents called 
"convertides" which are modified so that they have ambiguous hydrogen bonding, 
properties which permits them to base pair with more than one base and in some 
cases with each of the natural DNA bases (A, T. G and C) . The project leader appears • 
-to-be-interested in l-(2'-deoxy- -D-r1bofuranosyl )-3-n1tropyrrole, abbrevi ated 
02 which can pair with all of th"e natural bases. It is noted that many other 
•convertides" are proposed to be made in Project 3 although it is not clearly stated 
whether any of these other than Q2 will be tested in this project. These studies 
rely on supply of the analogues by Drs. Bergstrom and Hammer, from Project 3. 
At the Site Visit, it was reported that Dr. Bergstrom had just delivered to the 
project leader a new modified nucleotide, Q18, not discussed in the application, 
which will be tested. 

The approach taken by the project leader in studying these particular types of 
modified nucleotides is derived from the observations of others that allelic specific 
PCR amplification is enhanced by destabilizing bases near the 3'end I of a primer. 
Most notable here is perhaps the work of Dr. Thilly at MIT in developing KAMA 
technique (KAMA - mismatch amplification mutation assay). The project leader states 
that "This concept of destabilizing the enzyme-nucleic acid complex to significantly 
increase the specificity of a reaction is the basis behind our site-specific 
mutagenesis studies [discussed in several sections of the application]". In the 
rev i ewers' s opinion, the project leader makes a large leap 1n coming to this 
conclusion. Dr. Thilly's work used one or two natural bases at the 3 end of 
a PGR primer-which were mismatches of bases on the template. This is quite different 
from the "convertides" to be used here which have ambiguous hydrogen bonding 
properties that allow them to pair to more than one of the natural bases. There 
is no way to predict the effect of these "convertides." 

The' general ~a-pp roach of testing the effect of modified nucleotides on ligation 
specificity is strongly supported. However, the proposed approach might be too 
narrowly focused on the Q2 nucleotide and the specific observation mentioned above 
that destabilizing bases near the 3' end of a primer enhances allelic specific 
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PCR amplification.- It is difficult to predict the effects of almost any modified 
nucleotide. This section of the project might be strengthened by a broader approach 
including types of.nucleotide modifications other than the "convertides" proposed. 
At the very least, "convertides" other than Q2 should be incorporated into these 
studies as soon as possible. Q2 was designed to be a universal nucleotide and 
a part of this design was to make it lack a hydrogen donor site and maximize stacking 
interactions (see Project 3). All of the other "convertides" proposed have a 
different design principle in that they all have a hydrogen donor site. It is 
not possible to predict how important this difference might be. In the reviewer's 
opinion, it should be addressed as soon as possible. 

Determination of the three dimensional structure of Tth ligase is clearly an 
important problem. The actual work will be done by Dr. Aneel Aggarwal who has 
the experience to perform this study. No one can predict the difficulty in 
determining a protein structure. Even if the structure is not solved, this project 
can yield important information. However, the value of~the information would 
increase greatly if the structure were solved, especially early on. 

Intrinsic Scientific Merit Score: 145 

Administrative Note : Attention is drawn to the potential scientific overlap between, 
the proposed studies by Dr. Francis Barany and those funded at Applied Biosystems 
Inc., on "Ligation Amplification Technology" (2/1/92-1/31/97). 

Personnel : 

Name: Francis Barany 

Degree/Discipline/Date: Ph.D.. microbiology, 1981 
Role/Percent Effort: Project Leader, ten percent 

Qualifications/Experience: Dr. Barany was a postdoctoral fellow with Dr. Hamilton 
Smith at The Johns Hopkins from 1982-1985. He joined the faculty of Cornell 
University Medical College in 1985 as an Assistant Professor and currently holds 
the rank of Associate Professor. He is the author of numerous publications relevant 
to this Project. His research accomplishments include cloning of Tth ligase; he 
is one of the inventors of the Ligase Chain Reaction. 

Assessment in designated role: highly qualified 

Name: Aneel K. Aggarwal 

Degree/Discipline/Date: Ph.D., biophysics, 1984 
Role/Percent Effort: Co- Investigator, ten percent 

Qualifications/Experience: Dr. Aggarwal was a postdoctoral fellow with Stephen 
Harrison at Harvard university. In 1989, he became Assistant Professor of 
Biochemistry and Molecular Biophysics at Columbia University. Dr. AggarwaTs 
postdoctoral research on the structure of phage 434 repressor-DNA complex and his 
more recent work since coming to Columbia University on the structure of BamHl, 
appear to make him well suited to lead the structural studies outlined in this 
"project. " " 

Assessment in designated role: well qualified 



Name: Jianying Luo 
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Degree/Discipline/Date: Ph.D., biochemistry, 1992 
Role/Percent Effort: Research Associate, 50 percent 

Qualifications/Experience: has constructed, isolated, sequenced, and characterized 
proteins from the mutants of Tth ligase gene. Furthermore, she has performed all 
the ligase fidelity assays described in the preliminary results. 

Assessment in designated role: qualified 

Budget : appropriate as requested. 

Assessment : Level of merit; excellent. 

Project 5: Design and Synthesis of DNA and PNA Arrays 
(George Barany, Ph.D.) 

Description : (Applicant's description) The goal of this program project is to 
develop methods for identifying multiple gene mutations in cancers. For maximum 
utility, these methods must be able to recognized and discriminate between dozens 
or hundreds of mutations. 

To accomplish this, we propose to capture specific ligase detection reaction (LDR) 
products on a spatially addressable array, such that the position of a signal . 
identifies a mutation. Each LDR product will have a "zip code" tail, which will 
be selectively captured by a "complementary zip code" on a solid support. The 
complementary components can be DNA oligonucleotides or peptide nucleotide analogues 
(PNA). PNA/DNA hybrids have significantly higher Tm values than DNA/DNA hybrids. 
Incorporation of the nucleotide analogue, 5-propynyl uridine, into DNA zip code 
and PNA address sequences will further increase and optimize Tm values (Project 
3) Unreacted LDR primer may therefore be washed away at high temperatures allowing 
for a higher sensitivity in detecting LDR products. A reusable, universal 
addressable array could be used for detecting a wide range of cancer mutations, 
genetic diseases and infectious agents. 

Implementation of these concepts, with the ultimate goal of achieving reliable 
and efficient materials and procedures that can be incorporated into easy-to-use, 
automated, low-cost diagnostic devices, will follow these aims: (t) Development 
and evaluation of solid support materials compatible with chemical synthesis of 
Dna oligonucleotides and PNA oligomers, and compatible with subsequent hybridization 
reactions. Surfaces, beads, or membranes will be functional ized , and extended 
as needed with hydrophilic spacers such as heterobifunctional polyethylene glycol 
(PEG) and/or carbohydrates. Chemistry for 1 inking ol igomers to the solid support, 
and/or solid-phase assembly of oligomers, will be developed. (1i) Establishment 
of methodology for synthesis of spatially addressable arrays of DNA oligonucleotides 
and PNA oligomers. Appropriate masking technology will expose defined regions 
of the solid support for attachment of pre-formed oligomers, or for chain elongation 
to assemble the needed oligomers. In the latter mode, segment condensation will 
be used when possible in order to provide efficient convergent synthesis, and because 
chemical "failures" will become "invisible", during the subsequent hybridization, 
(iiij Demonstration "of scope and 1 imitations' of zip co'de~c6ncepts " As aims (i) 
and (ii) come to fruition, testing will be carried out (Core B). Design of primer 
and zip code structures will be facilitated by the informatics collaboration (Core 
A). 
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Qritioue : In order to carry out large-scale screening of mutations, this project 
proposes to develop spatially addressable arrays of oligonucleotides or peptide 
nucleotide analogs. Ligase reaction products from the technology devised in Projects 
1 and 2, will be "constructed with fluorescent groups, and will bear specific 
"zip-code" tails. The tails will be selectively captured by complementary zip-code 
probes inroobilized on the array. Each zip-code will map for a specific known genetic 
mutation. 

The concept of zip-codes,-analogous -to the-multiplex detection scheme of -Church 
and Gilbert, has been very well developed in this study. The codes are conceived 
to be unique 24-mers, designed to have very little similarity overlap. A novel 
block synthetic scheme, suited to this design is proposed for PNAs. 36 of the 
256 different tetramers have been chosen on the basis of minimal similarity, and 
will be synthesized as building blocks for 24-mer arrays. Approximately 2 X 10 
9 24-mers could be constructed from this building block set, out of a possible 
approximately 3 X 10 14 24-mers. Initially, five tetramers have been chosen as 
synthetic targets for feasibility, testing of the. array, technology. - - 

Before actual arrays are to be constructed, a significant amount of DNA/PNA array 
development work is proposed. First, solid supports compatible with DNA or PNA 
synthesis will be screened. A variety of potentially compatible materials will 
be investigated, including glass, plastic, cellulose, PEG-PS beads and a variety 
of membranes. The Project Leader has a good command of sol Id-phase synthesis* 
techniques, brings strength to this part of the project, and has put together an - 
impressive l ist of _ surface_functional and_l inker groups. In addition to the 
compatibility with "synthesis, the hybridization compatibility of theses supports 
will also be screened. 

The investigators propose to adapt newly developed PNA chemistry to the 24-mer 
array format. Although PNA/DNA complexes are known to be extremely heat stable, 
it is not clear that this will actually help the detection discrimination issues 
presented in this proposed study. 24-mer DNA complexes are already quite stable, 
and the array design itself preselects optimal 24-mers for minimal cross- 
hybridization. A significant amount of chemistry must accompany the PNA array 
development, and although synthetically efficient and scientifically interesting, 
the central need for PNA arrays for achieving the goals of the program project 
remains unclear. 

The proposed study schematically illustrates a microchannel device to construct 
the DNA or PNA arrays. Embedded in the actual fabrication of the arrays, is an 
enormous amount of engineering and development work; however, noworkplan or budget 
is provided. During the site visit, the investigators deferred this development 
activity to collaborators in laboratories at Slrrus or Millipore. Unfortunately, 
the reviewers have no way to judge the commitment or capability of the investigators 
at Sirrus or Hillipore to meet the priorities of this program project. 

In summary, the strengths of this project lie in the novel investigations related 
to of DNA zip-codes. The concept is well developed, designed and a synthetic 
procedure for-ef f icient construcU^ 

chemistry in this section is outstanding. However, the lack of expertise and 
experimental planning in hybridization, fluorescence detection, and perhaps most 
importantly, the inabil ity to construct the arrays seriously weakens this section. 
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Intrinsic Scientific Merit Score: 244 



Deg?ee/Di!ci^ine/Date: Ph.D., biochemistry, mathematics, organic chemistry, 1977 
Role/Percent Effort: Project Leader, ten percent ♦ ♦u 

£ll1f?St1ons/Exper1ence- Dr. Barany is currently Professor of Chemistry at the 
Sniversity of Minnesota. He an expert on new methods of peptide synthesis, 
onhogonaY protection, onjanosulfur chemistry and various polymer ^nctionil izition 
chemistries He has an outstanding publication record, and is considered to be 
MgKfquanfied to carry out the synthetic and solid-phase chemistry on this 
project. 

Assessment in designated role: highly qualified. 

Name: Josef Vagner . 
Degree/Discipline/Date: Ph.D., biochemistry, 1990 
Role/Percent Effort: Postdoctoral Associate, 100 percent 

Qualifications/Experience: research worker in Prague (1986-1992) with experience 
in solid-phase and solution peptide synthesis, immunology, and protein chemistry. 
Presently, a postdoctoral fellow at the University of Minnesota. 

Assessment in designated role: qualified. 

Budget : The modest budget, as requested, is recommended for approval. 
Assessment : Level of merit; very good to good. 

Core A: Informatic Support for Cancer Detection Methods 

(Neil R. Hacxett, Ph.D.) 

Des cription : (Applicant's description) The goal of this program project is to 
develop techniques that detect multiple cancer mutations, ultimately for the purpose 
of researching the relationship between genetic alterations and tumor behavior, 
and applying these techniques in clinical situations. Managing a database of cancer- 
associated mutations, developing multiplex assays for them and correlating multiple 
cancer mutations with disease outcomes will require a sophisticated level of data 
management. 

Core A will provide informatics support for cancer detection according to the 
following aims: (1) Create and maintain a database of mutations associated with 
cancer, patient history and experimental results. A relational Client/Server 
database will be created on a central facility consisting of a SPARC station^lO 
running the Sybase database management system. Both published reports and results 
from Projects 1 and 2 will be collected, (ii) Analyze database for correlations 
of point mutations with clinical outcome. The significance of the cancer detection 
experiments in Projects 1 and 2 will be assessed by perfonning multivariate analysis 
on given mutations to determine whether they predict clinical outcome. (111) Write 
programs for the choice of primers for PCR/LDR, LDR/PCR and PCR/RE/LDR protocols. 
The programs will be written in C language for use on IBM/ PC or Macintosh computers 
with a simple text-base interface. Thee programs will aid primer design and 
calculation of modified primer Tm values for Projects 1, 2 and 3. (iv). Assist 
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in the design and analysis of oligonucleotide arrays for mutation detection. For 
project 5, potential schemes for array design will be explored to ensure arrays 
of the maximum difference in sequence between every pair of zip codes while 
maintaining a constant melting temperature, (v). Assist in the programming of 
the instruments in the diagnostics and evaluation core and interface these with 
the central database (for Core B). The informatics core will maintain a 
Client/Server database on a SPARC station, and provide programming support which 
is accessible to all participants in the program project. This research may lead 
to correlations between molecular markers and prognosis for lung, colon, breast, 
and cervical cancers. 

Critioue : This critique is based on the assessment of the investigators rather 
than their written or verbal description of the proposed work, since the core is 
vague in a number of areas. 

The investigators have experience in biological sequence analysis, and together 
with Dr. Barany's knowledge of pri»er chemistry, should be well equipped to develop 
primer selection programs (task 3). The reviewers at the site visit asked Dr. 
Hackett specific questions about his strategy for this problem and received a 
response indicating Dr. Hackett's excellent command of this problem. 

The staff has experience with instrumentation and should be able to program and' 
interface the various instruments required by the project (task 5). However, The 
staff in th_i_s_Core have_a very 1 i»ited or Jj^ck.pX experie.nj«.J_n .da.ta base which ■ 
is of concern to the reviewers. The application does not provides details on any 
aspect of the database design, not even examples of specific data that will be 
stored in the database, e.g., specific items of clinical history that will be stored. 
At the site visit. Dr. Hacket presented an example purporting to illustrate his 
approach to cl inical data. The example was unconvincing from a cl inical standpoint 
in that it contained no relevant cl inical data. The computing aspectual so revealed 
a serious technical flaw. (The table on the upper right hand corner contained 
data values that were used as column headings in the table in the lower left, 
violating a standard tenet of relational database design). 

There is no technical discussion of the decision to use SYBASE. The proposal 
mentions the importance of client/server architecture which 1s valuable 1n many 
situations, but it is not clear that 1t is important 1n this study. SYBASE 1s 
a compl icated product, and the staff appears to have no experience with It. Perhaps 
a simpler MaC-based product, such as Fourth Dimension, would be a better choice. 

It is unclear as to why this project needs to maintain a database regarding cancer 
causing mutations, except for the mutations being analyzed in Projects 1 and 2. 
The project leaders of Projects 1 and 2 are well aware of the 1 Iterature and on-going 
work related to the genes being studied in these projects, and are unlikely to 
require a database of all cancer causing mutations. The investigators may want 
to do a literature search to see 1f they have missed any new reports of relevant 
mutations, but this is best done by a search of HedLine and public databases using 
standard software. 

The complex issues involved in clinical data management were not discussed at the 
site visit. Dr. Osborne volunteered that he and his co-workers at Strang have 
extensive experience in this area and would be delighted to help out. Although 
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this is viewed positively, this involvement should have been explicitly described 
in the application. 

No technical discussion on how the Strang clinical database will be translated 
to the proposed database. It is not clear if the proposed database will use the 
same schema as Strang. If not. some translation effort is needed to bridge the 
differences. 

Additionally, how data would be moved from Colorado to the proposed database is 
also unclear. Is the Colorado data is available on computer already, then 
translation issues similar to those mentioned above must be addressed. If not, 
substantial effort will probably be required to extract and computerize this 
information based on patient records. 

Experience in Biostatistics area is yet another concern. The investigators provide 
no details on biostatistical aspect of the clinical design, nor is there any 
discussion of statistical methods that will be used to analyze the data. At the 
site visit, Dr. George Wong, a bio-statistician at Strang, expressed his enthusiasm 
for the proposed work, but confirmed that he has not been involved in the project 
to date. The involvement of a biostatistician is essential for the design of the 
proposed clinical studies and the analysis of the results. 

Project 1 includes some discussion of data management and statistical analysis- 
for data collected in Colorado. There seems to be no coordination between the 
Colorado effort and the Informatics Core. These two efforts propose to use different 
types of computers (Mac versus Unix), different database management systems (Paradox 
versus SYBASE), and different statistical packages (SYSAT and SAS versus SPSS). 

Personnel : 

Name: Neil R. Hackett 

Degree/Discipline/Date: Ph;0.. biochemistry , 1982 
Role/Percent Effort: Core Leader, 20 percent 

Qualifications/Experience: postdoctoral training in the Department of Chemistry 
at MIT; Assistant Professor in the Department of Molecular Biology at Vanderbilt 
University for three years, and since 1989 has been Assistant Professor in the 
Department of Microbiology at Cornell University. Since 1990, also served as the 
Manager of the Molecular Biology Computing Facility at Cornell University. He 
has several publications in Molecular Biology, but lists no publications related 
to informatics. 

Assessment in designated role: qualified. 
Name: Aaron Giles 

Degree/Discipline/Date: B.S., physics, 1992 
Role/Percent Effort: Programmer, 100 percent 

Qualifications/Experience: worked as a programmer at the High Energy Physics 
Department, University of Chicago through June of 1993 at which time, he joined 
the Office of Academic Computing at the Cornell University. His CV suggests 
experience in software development at the hardware/software boundary, such as 
firmware for attached laboratory devices, print drivers, and image viewing software. 
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Assessment in designated role: qualified. 

Budget : The operating system ($5,000) is included in the "Equipment" category 
and is thus deleted from the 'Supplies' category. The budget of $2,500 in the 
"Other Expenses" category is not justified and therefore deleted. Due to the 
excessive requested budget in the years 02-05, the costs for "Equipment" (-$2,000), 
and for maintenance in the "Supplies' categories (-$1,000) are recommended to be 
reduced. Additionally, the cost for 'Other Expenses' (-$2,500), is recommended 
to be deleted as it is not justified. 

Assessment : Level of merit; acceptable. 

Core B: Instrumentation and Mutation Detection 

(Francis Barany, Ph.D) 

Description : (Applicant's description) Correlations of multiple cancer mutations 
with disease outcome will require the ability to perform high throughput mutation 
detection. The goal of this core is to provide the instrumentation and mutation 
detection support required to achieve large scale identification and analysis of 
mutations. Core B will work closely with Core A, the informatics support for cancer 
detection methods. 

This Core will have the following responsibilities: (i) Providing instrumentation 
fpr.ol igonucleotide synthesis and analysis of cancer causing mutations. The PCR/LDR, * 
LDR/PCR, and PCR/RE/LDR experiments described in Projects 1 and 2 require 
synthesizing large numbers of oligonucleotides. The products from these cancer 
detection amplifications willbe separated and quantified on an ABI 373A DNA 
sequencer. By the third year we plan to automate some of the PCR/RE/LDR steps 
using a robotics workstation, (ii) Testing the efficiency and polymerase fidelity 
of nucleotide conversions using convertide oligonucleotides. The PCR/RE/LDR cancer 
detection scheme is dependent on the f idel ity of thermostable polymerase extension 
off primers containing a 3' nucleotide analogue (Project 3). Using an assay we 
developed, the Core will test both the efficiency and fidelity of different 
polymerases for each base conversion. (iii) Testing oligonucleotide or PNA 
addressable arrays for quantitative cancer mutation detection. Large scale detection 
of a multitude of mutations will require addressable arrays. Nutations will be 
distinguished by the position of a fluorescent signal on the array. The Core will 
test arrays synthesized in Project 5 for fluorescent detection of LDR and LCR 
oligonucleotide products, using a Molecular Dynamics Fluorlmager 575. 

Critique : Core B is considered as an essential part of the program project. The 
specific responsibilities include providing oligonucleotides to the rest of the 
program, robotics support for the PCR/RE/LDR assays, testing the efficiency and 
polymerase f idel ity of nucleotide conversions, and testing of the oligonucleotide 
or PNA arrays for quantitative cancer mutation detection. 

With respect to ol igonucleotide synthesis, an upgraded Model 394 automated 4- column 
synthesi2er donated by ABI is available for producing-the-required oligonucleotides. 
This is now conventional technology which should not pose any difficulties. The 
investigators in this group have experience with gel-purified oligonucleotides, 
which represents valuable know-how as purity of probes and primers will be a critical 
issue. 
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Currently products from the experiments described in projects 1 and 2 are separated 
an7q U antif[ed on-a Model 373A DNA sequencer which has also been donated by AB1 
and is more or less turn-key technology. The concept of using, for example, 
nexaethylene oxide "tails' to differentiate multiplexed PCR/LDR and related 
iSifiwtlSn products has already been proven in referenced publications 2 and 
3 P There is ample precedent for the use of capillary electrophoresis to achieve 
the proposed separations. The proposed "zip code" concept for the PNA arrays 
should be an effective detection scheme if implemented into arrays. The Beckman 
Biomek or ABI Catalyst 800 robotics equipment for pipeting, etc. seems reliable 
and justifiable. 

In this Core, the efficiency and polymerase fidelity of nucleotide conversions 
using convertide oligonucleotides provided by Project 3 will be Jested The 
aoDl icant has devised a series of assays to determine how well a nucleotide analogue 
can "read" a natural base, which were considered sufficient by the review committee. 

Finally oligonucleotide or PNA addressable arrays for quantitative cancer detection 
will be tested. Test arrays are to be provided by Project 5 and hybridization 
will be detected using a Holecular Dynamics Fluorlmager 575 to be purchased in 
year-1 for $80,000. This system will likely require a substantial amount of 
unbudgeted software development. The hybridization assays will also require a 
substantial amount of optimization, especially if quantitation is desired. Overall,, 
the core group could benefit from expertise in software and instrumentation 
development. 

Personnel : 

Name: Francis Barany . 

Qualifications described earlier under Principal Investigator and Project 4. 
Assessment in designated role: highly qualified. 
Name: Matthew Lubin 

Qualifications described earlier in Project 2. 
Assessment in designated role: well qualified. 
Name: Jianying Luo 

Qualifications described earlier in Project 4. 
Assessment in designated role: qualified. 

Budget : The modest budget is approved as requested, except that the level of eff °J*J 
for the research technician is recommended to be changed from six months at 100 
percent effort to one year at 50 percent effort. 



Assessment : Level of merit; excellent. 
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Core C: Administrative Core 

(Francis Barany, Ph.D.) 

pescription : (Apffl icant 's description) The structure of the administrative core 
has been designed to meet the planning and managerial needs of the program project. 
It is also aimed at combining the expertise of all institutional administrators 
and scientific program directors in order to enhance the proficiency of the research 
level in order to promote cooperative efforts at all collaborating institutions. 

The administrative responsibility of the Administrative core will include: 1) 
monitoring the scientific and administrative diligence of each of the component 
projects and re-allocating resources when and if necessary, 3) schedul ing meetings 
of group investigators to be held approximately three time a year, 4) to keep the 
NIH Scientific Program Director/Coordinator apprised of group progress, changes 
in scientific aims, personnel, etc. 5) preparing progress reports for the NIH 6) 
assuring that core resources are providing adequate support to all projects, 7) 
coordinating group activities with all external companies in keeping with the formal 
collaborative research arrangements, and 8) monitoring inventions and invention 
disclosures supported by the program. In this fashion, the Administrative core 
will enhance maximum coordination between investigators participating 1n the program 
and facilitate the research objectives of the program project. 

Critique : Dr. Barany's past research accomplishments demonstrate his ability to 
administer effectively individual R01 projects. Although his record of administra- 
tive experience other than that required for individual projects is only recent 
and limited, his success to date in assembling the program project key Investigators 
is evident. His collaboration since 1991 with Dr. Wilson has resulted in a co- 
authored publication. He successfully initiated research discussions with Drs. 
Aggarwal, Hackett, and Lubin that led to the plans proposed as Project 2, Project 
4, Core A, and Core B. He recruited Dr. Wilson for involvement in this program 
project. He has secured the commitment of nine potential members of an external 
advisory panel . 

The plan to conduct productivity evaluations at formal annual meetings 1s 
appropriate. Indicators to monitor progress and expenditures are not established. 
Decision to date appear to rely on his personal communications with others. 
Effective interchange requires a structure commitment to frequent convening of 
the collaborators as a group to discuss current considerations; however, the absence 
of a plan for such sessions between annual meetings raises a serious concern. 

A process for allocation and reallocation of funding cited as a responsibility 
within this Core is not documented. A role for the Administrative Core in 
contributing to the management of resources at the project level 1s not defined. 
The program appears to rely largely on effective administration at the level of 
project and core leaders, with the Administrative Core responsible primarily for 
interproject cormuni cat ions and reporting. 

The issue of potential changes in leadership presents some concerns. A single 
successor to Dr. F. Barany as the Principal Investigator is not obvious. In the 
absence of publ ished work in this field, it is not apparent that Dr. Lubin (Project 
2) nor Dr. Aggarwal (Project 4) could conduct these projects as proposed in his 
absence. The plan to include external advisors in internal advisory meetings is 
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commendable. The application provides letters of commitment from the proposed 
external advisors," although these letters do not explicitly confirm the expectation 
to attend an annual meeting. To maximize effectiveness of the annual review, plans 
are needed regarding preparations for this session and follow-up on subsequent 
recommendations. The numerous letters of support from other individuals interested 
in development of the program actually raise concerns about the focus and priorities 
of the planned studies. 

Personnel : 

Name: Francis Barany 

Degree/Discipline/Date: Ph.D., uicrobiology, 1981 
Role/Percent Effort: Core Leader, five percent 

Qualifications/Experience: Dr. Barany is personally involved in the current research 
collaborations between Project 2 and 4, and Cores A and B. He personally has secured 
the conmitment of potential members of the external advisory panel. Dr. Barany 1 s 
biographical sketch does not indicate any administrative experience to date beyond 
leadership of an R01 grant. His recruitment of an experienced administrator as 
Co-Leader of the Administrative Core shows good judgment. 

Assessment in designated role: qualified 

Name: Michael J. Bunk 

Degree/Discipline/Date: Ph.D., nutritional biochemistry, 1980 
Role/Percent Effort: Core Co-leader, 5(YR-01) to 10(YR-02-05) 
Qualifications/Experience: Director, Research Resources Management, Strang Cornell 
Cancer Prevention Center since 1993; previously Assistant Program Director, Clinical 
Nutrition Research Unit (19B7-89), Director of Foundation Relations (1989-91), 
and Senior Grants Management Specialist (1991-93) at Memorial Sloan-Kettering. 
Dr. Bunk has served for one year, at an affiliate (Strang) of the applicant 
organization, in a- title-very- s imi-VaT-to--the proposed role. The application, 
however, does not describe his current duties nor the relevance of this experience 
to his proposed role. He has a total of six years of potentially relevant prior 
administrative experience in three different programs outside the applicant organiza- 
tion. The current appl ication and the collaborations developed to date demonstrate 
administrative skill, but Dr. Bunk's specific contribution to date to these 
preparations to evaluate the percent effort requested. 

Assessment in designated role: qualified. 

Budget :— A- number of administrative systems will need to be developed and established 
immediately in Year -01. In the absence of evidence that the administrative 
responsibilities will increase dramatically from the Year 02, it is recommended 
that the Program Coordinator's effort should remain at 30 percent during this time. 
The remaining budget is appropriate as requested. 

Assessment : Level of merit; good to acceptable. 

Women and Minorities in Study Population : All of the proposed clinical materials 
will come from existing tissue banks from which individual patient identification 
cannot be made. Therefore this proposal is considered exempt from Women and Minority 
considerations. 
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Project 1 (Dr. Francis Barany, Cornell University Medical College 
BUDGET CATEGORIES Requested Recommended 
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THIRD PARIY COSlii 

Direct 

Indirect 



71,545 
28,959 



TDTAT 



106,564" 



71,545 
28,959 



166,504 



Project 1 (Dr. FrancisBarany, Cornell University Medical College 
(The Children's Hospital)) 



BUDGET CATEGORIES 



Requested 



Recommended 



54,456 
2,595 

10,300 
1,200 



28,959 
3.000 



PERSONNEL 
EQUIPMENT 
SUPPLIES 
TRAVEL-Domestic 
THIRD PARTY COSTS 
Indirect Costs 
42 percent 
OTHER EXPENSES 



54.456 
2.595 

10.300 
1,200 



28,959 
3.000 



166.564 



TDTAT 



166.564 



Project 2 

BUDGET CATEGORIES 



Requested 



Recommended 



64.666 
14.500 
1.200 
3,000 



PERSONNEL 
SUPPLIES 
TRAVEL-Domestic 
OTHER EXPENSES 



64,660 
14,500 

1.200 
3.000 



103.560 



TOTAL 



163.566 



Project 3 (Dr. Francis Barany, Cornell University Medical College 
BUDGET CATEGORIES Requested Recommended 



THIRD PARTY COSIS 

Direct 

Indirect 



131,170 
60.888 



131,170 
60,888 



T6TAT 



192.656 192.056 
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BUDGET CATEGORIES 



Requested 



Recommended 



PTTOTNTl" 
SUPPLIES 
TRAVEL-Domestic 
THIRD PARTY COSTS 
Indirect Costs 
51 percent 
OTHER EXPENSES 



44,305' 
13,000 
1,200 



33.663 
7.500 



44,305 
13,000 
1,200 



33,663 
7,500 



TDTAT 



99,666" 



99,666 



Project 3 (Dr. Robert P. Hammer, Louisiana State University) 



BUDGET CATEGORIES 



PERSONNEL 
EQUIPMENT 
SUPPLIES 
TRAVEL-Domestic 
THIRD PARTY COSTS 
Indirect Costs 
45 percent 
OTHER EXPENSES 



Requested 



40,606" 
4,665 

14,000 
1.200 



27.225 
4.500 



-TDTAT 



92.390 



Recommended 



40,600 
4,665 

14,000 
1,200 



27,225 
4,500 



92.390" 



Project 4 (Dr. Francis Barany, Cornell University Medical College) 



BUDGET CATEGORIES 



Requested 



Recommended 



PERSONNEL 
SUPPLIES 
TRAVEL-Domestic 
THIRD PARTY COSTS 
Direct 
Indirect - 
OTHER EXPENSES 

TOTAT 



37.647 
10.500 
1.000 

60.918 
40,815 
3.000 

154.086 



37.847" 
10.500 
1,000 

60.918 
40.815. 
3.000 

154,080 
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Project 4 (Dr. Aneel Aggarwal, College of Physicians and Surgeons 
of Columbia University) 



BUDGET CATEGORIES* 


Requested 


Recommended 


PERSONNEL 


45,516 


45,918 


SUPPLIES 


10,500 


10,500 


TRAVEL-Domestic 


1.500 


1.500 


THIRD PARTY COSTS 


40,815 


40.815 


OTHER EXPENSES 


3,000 


3,000 


TOTAL 


101,733 


101,733 


Project 5 






BUDGET CATEGORIES 


nequesiea 


Keconrn^naea 


PERSONNEL 


64,069 


L A A^a 


SUPPLIES 


15.500 


15,500 


TRAVEL-Domestic 


1.200 


1.200 


THIRD PARTY COSTS 






Indirect Costs 


85.769 


85,769 


40 percent Direct 


34.308 


34,308 


OTHER EXPENSES 


5,000 


5,000 


TOTAL 


120,077 


120,077 


Core <A 






BUDGET CATEGORIES 


Requested 


Recommended 


PERSONNEL 


70.488 


70.466 


EQUIPMENT 


15,000 


15.000 


SUPPLIES 


16.500 


11,500 


TRAVEL-Domestic 


1.200 


1,200 


OTHER EXPENSES 


2.500 


0 


TOTAL 


105.688 


96.166 


Core B 






BUDGET CATEGORIES 


Requested 


Recommended 


PERSONNEL 


50,636 


60.636 


EQUIPMENT 


80.100 


80,100 


SUPPLIES 


11.000 


11,000 


OTHER EXPENSES 


5,000 


5.000 


TOTAL 


156.736 


156.736 
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Core C 

BUDGET CATEGORIES- Requested Recommended 

PERSONNEL — ZZqjK 

TRAVEL-Domestic 9,000 9,000 

OTHER EXPENSES 5,500 5,500 

TDTAT ~ 3*7335 

SUMMARY RECOMMENDED BUDGET* 



BUDGET CATEGORIES 


01 Year 


02 Year 


03 Year 


04 Year 


05 Year 


Personnel 
Consultant Costs 
Equipment 
Suppl ies 

Travel (Domestic) 
Third Party Costs 
Direct Costs 
Indirect Costs 
Other Expenses 


276,669 
1,000 
95.100 
47.500 
12,400 

349,402 
164,970 
16,500 


343,663 
1,000 
6.000 
43.500 
12,400 

350,142 
164.970 
16,500 


343,603 
1,000 
76.000 
43,500 
12.400 

350.142 
164.970 
16,500 


343,603 
1,000 
6,000 
43,500 
12.400 

350.142 
164,970 
16.500 


343.803 
1,000 
6,000 
43,500 
12.400 

350.142 
164.970 
16.500 


TOTAL 


963.S41 


936.315 


1,006,315 


936.315 


936.315 



♦Appropriate escalation factors to be added at the time of an award. 
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CORNELL UNIVERSITY MEDICAL COLLEGE 

Department of Microbiology, Box 62 
1300 York Avenue, New York, NY 10021 
Telephone: (212) 746-6509 Fax: (212)746-8587 



November 11. 1994 

Dr. James Jacobson 
National Cancer Institute 
Executive Plaza North 
Rm 513 

61-30 Executive Blvd. 
Rockville, MD 20892 
Tel.# 301-496-1591 



Dear Jim, 

I appreciate our frank discussion on Friday, October 28th, and am following up on your 
suggestion to send you a brief letter outlining the issues and concerns of the co-investigators of our 
program project grant entitled "New Methods for Cancer Detection." We are strong supporters of peer 
review, and have found review of our own proposal (not funded) both fair and very constructive. We are 
writing this letter because we would like to preserve the sanctity of NIH and NCI review. 

An event of the last week has reopened our concerns that there were hidden conflicts with the 
review of our program project grant. One of our colleagues at the recent American Society for Human 
Genetics meeting reported that a group from Affymetrix, a for profit corporation of Santa Clara, CA, 
presented work wherein they are using DNA ligase to join oligonucleotides hybridized on a DNA chip. 
In particular they claim that perfectly matched hybridization units are covalently joined and those in 
which there is as little as a single base mismatch at the ligation junction are not. They claim this offers 
greater discrimination between perfect hybridization units and single nucleotide mismatches, because the 
unligated labeled probe molecules can subsequently be washed away. 

It is our opinion the above presentation constitutes a substantial overlap with the contents and 
ideas presented in our NCI program project grant This material was made available to the NCI site visit 
team, which included Dr. Stephen Fodor, Scientific director and Chief Technical Officer of Affymetrix Inc. 
on April 29th, 1994 in a 448 page document, as well as discussed in detail during a 6 hr site visit held at 
Cornell University on June 1st, 1994. 

We were informed of the members of the site visit team via Fax only on April 29th, 1994, at which 
time grants were already sent to the reviewers. At the time, we expressed our concerns to Drs. Devi 
Vembu, David Irwin, Shiela Taube, and yourself, that certain members of the committee, among them Dr. 
Stephen Fodor, may have conflicts of interest. One such member was removed, but Dr. Fodor was not. 
You assured us that the NCI would vigilantly guard against any conflicts of interest problems. 

During the eight months that we organized and wrote our NCI program project grant, we 
realized that DNA arrays (Project 5) would be an important part of the proposal. At the time of our grant 
submission in February 1994, we were aware that competing groups from Affymetrix and Beckman were 
developing DNA arrays, however no detailed description of their work appeared in the peer reviewed 
literature. Since that time, two reports on DNA arrays have appeared, Pease et al. (with S. Fodor as 
senior author), Proc. Natl. Acad. Sci. 91: 5022-5026 in May 1994, and Eggers et al. (HARC-Beckman group) 



Biotechniques, 17:516-524 in September 1994. It is clear that the DNA arrays as described in the 
Affyrnetrix PNAS report would not be sensitive enough to detect cancer mutations as properly described 
in our program project grant 

Neither of these reports mentioned use of a DNA ligase for discriminating single base 
differences. Indeed, during the site visit one of us (F.B,) specifically addressed a question from Dr. Fodor, 
explaining to him that the specificity of a ligase enzyme will always give greater discrimination of a 
single base difference than hybridization alone. 

The above concepts are key to reducing array based DNA diagnostics to practical use. As a direct 
consequence of one of our (F.B.) publications introducing LCR and LDR in the peer reviewed literature in 
1991, ligase based detection methods have now become an area of intense competition between academic 
and industrial research teams. Affyrnetrix clearly has a substantial financial interest in this area, as 
evidenced by a $30,000,000 grant awarded by the National Institutes of Standards and Technology (NIST) 
in October 1994. The application for this award was likely prepared during the time of our program 
project site visit, since the NIST program was announced on April 25, 1994, and applications were due 
about two months later. (Affyrnetrix had access to our grant application from April 29, 1994 to June 1, 
1994.) Our NCI proposal contained a substantial amount of confidential information relating to arrays 
and ligase detection technology. Based on their recent ASHG presentation, we believe the Affyrnetrix 
NIST proposal may have substantial overlap with the aims of our NCI program project grant proposal. 

In order to determine if scientific ethics were breached, we feel it is appropriate that there be a 
thorough evaluation of the Affyrnetrix NIST proposal for overlap with our NCI program project grant. 
To avoid similar potential problems of scientific ethics when we resubmit our own NCI proposal, we feel 
it would be inappropriate for us to perform this thorough evaluation. We wish to request that either you 
or an impartial third party chosen by you conduct a comparative evaluation of the Affyrnetrix NIST 
proposal and our NCI program project proposal. In particular one should investigate whether 
Affyrnetrix proposed using thermostable ligase or borrowed intellectual and design concepts from our 
zip-code capture array. If there is overlap that involves plagiarism of intellectual property, then we 
would like a formal investigation. 

We know that you and NCI are fully committed to maintaining the highest level of integrity 
during the review process and shall assist you in any way possible. 

Sincerely yours, 



Francis Barany 

Associate Professor of Microbiology 
Tel.# 212-746-6509 



Donald E. Bergstrom 

Professor of Medicinal Chemistry 

Tel.# 317-494-6275 



c.c. Dr. Shiela Taube 

c.c. Dr. Robert Strausberg 



